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Lessons from Antarctica 


Twenty years on, the success of the Montreal Protocol can help inform plans to mitigate climate change. 


hole in Earth’s protective ozone layer sparked an international 

effort that has become one of the great environmental success 
stories. The Montreal Protocol, which came into force 20 years ago 
this year, halted the production of chlorofluorocarbons (CFCs) — the 
main culprits causing the hole above Antarctica — and set the sched- 
ule for phasing out most other ozone-destroying substances. 

Today, stratospheric ozone levels are no longer in decline. And, 
although plenty of ozone-depleting chemicals remain in the atmos- 
phere, there are signs of recovery in ozone levels at the mid-latitudes. 
The rapid and committed response of the world’s nations has proba- 
bly averted a future in which unfiltered sunlight would have scorched 
humans, animals and plants alike (see page 792). 

Yet the Montreal Protocol is often written off as a model for curb- 
ing greenhouse-gas emissions. The international response to the 
ozone hole was fast — the protocol was ready for signing just two 
years after the hole was discovered — in part because the threat was 
immediate, and because chemical alternatives to CFCs had already 
been developed. 

The situation is less clear cut for global warming — itisa slow-moving, 


Te revelation in 1985 that chemical emissions had ripped a vast 


long-term problem for which there is no single alternative to fossil- 
fuel use. Nonetheless, the Montreal model has much it can offer the 
fight against climate change. Governments are increasingly wak- 
ing up to the urgency of the threat of global warming. And there is 
already an arsenal of alternative-energy technologies, ranging from 
wind turbines to energy-conserving building design, that could be 
delivered quite quickly around the world — if there was a political 
will to make it happen. 

The Montreal Protocol’s Multilateral Fund offers a particularly 
good model for such action. Industrialized nations, which have pro- 
duced the majority of the world’s CFCs, have contributed more than 
US$2 billion to the fund since it began operation in 1991 to help 
poorer nations meet their obligations under the treaty. Over time, 
this system has inspired a level of trust that is sorely lacking in today’s 
global climate negotiations. 

Above all, the Montreal Protocol serves as proof that the countries 
of the world can work together. And if nations, particularly those in 
the developed world, continue to take inspiration from its success, it 
could help divided governments find a common path towards tack- 
ling climate change. a 


A change of tone 


There is every reason to be optimistic about the 
Obama administration's attitude towards science. 


after the January inauguration of President Barack Obama 

has faded fast this summer, as the US capital has descended 
into partisan gridlock over issues such as health-care reform and 
financial regulation. 

Nevertheless, positive steps are quietly being taken. Last week, for 
example, White House science adviser John Holdren convened the 
first meeting of the 21-member President’s Council of Advisors on 
Science and Technology (PCAST, see page 785) — a star-studded 
assembly of academics and industry researchers. 

As with any such meeting in the early days of a new administration, 
the PCAST gathering was high on enthusiasm and necessarily low on 
evidence of effectiveness. But Holdren, who brings an extensive back- 
ground in nonproliferation and energy issues to his post, noted that 
the council’s first report, on the government's response to the HIN1 
flu pandemic, is already in the pipeline for public release within a 
few weeks. And the meeting’s lively discussions on topics such as 
electronic health records and research comparing the effectiveness of 
medical treatments suggested that members fully expect their advice 
to be heeded by the White House. 

Also last week, Holdren and Peter Orszag, director of the White 
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House Office of Management and Budget, signed a memorandum 
to top federal officials outlining the administration's priorities as the 
agencies begin to prepare their budget requests for fiscal year 2011. 
The memo emphasized the key part played by science and technol- 
ogy in those priorities — including economic recovery, health care, 
energy and climate. And it featured the telling phrase, “sound science 
should inform policy decisions”. 

That was something not seen too often during the administration of 
George W. Bush, which regularly sidelined science. Last week, a report 
from the Bipartisan Policy Center, a think tank in Washington DC 
established in 2007 by four former Republican and Democratic sena- 
tors, served as a reminder of how much work remains to be done to 
integrate science properly into political decision-making. The report 
makes recommendations that seem like common sense: proposed 
regulations should set out what science questions need to be answered, 
for instance, and there should be clearer conflict-of-interest rules for 
anyone appointed to a scientific advisory committee. Common sense 
on scientific matters was all too often lacking in Bush's Washington. 

Many challenges await Holdren in the coming months and years, 
including helping to convince Congress whether major invest- 
ments in science and technology are warranted, and then delivering 
progress reports that clearly delineate the success or failure of such 
investments. His White House team is far from complete; two of his 
office's four associate director positions — for science and for national 
security — remain empty and should be filled as quickly as possible 
with qualified nominees. But overall, science advice in the Obama 
era is off to a good start. a 
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RESEARCH AIGHLIGHTS 


CANCER BIOLOGY 


Suicide by nucleotide 


Cancer Cell 16, 103-114 (2009) 
A synthetic molecule that mimics double- 
stranded RNA viruses can trigger melanoma 
cells to digest and kill themselves. 

Melanoma cells often evade the immune 
system and deactivate cell-death pathways. 
But Maria Soengas at the Spanish National 
Cancer Research Centre in Madrid and her 
colleagues have found that the cells respond 
toa viral RNA mimic called polyinosine- 
polycytidylic acid, an adjuvant that boosts 
immunity. 

When administered alongside the 
carrier molecule polyethyleneimine, the 
complex caused melanoma cells — but not 
normal cells — to digest their organelles 
and eventually undergo programmed 
cell death. The process seems to happen 
without the help of other immune cells: the 
complex efficiently killed melanoma cells in 
immunocompromised mice. 


ATMOSPHERIC CHEMISTRY 
Isoprene's fate 


Science 325, 730-733 (2009) 

Isoprene emitted by trees has long been 
thought to form aerosols in the atmosphere, 
but how it does so was unclear. 

Fabien Paulot of the California Institute 
of Technology in Pasadena and his 
colleagues used chemical ionization mass 
spectrometry to monitor the photooxidation 
products resulting from the reaction of 
isoprene with hydroxyl radicals in an 
experimental environmental chamber. 

They found that isoprene initially forms 
hydroxyhydroperoxides. Surprisingly, further 
reaction with hydroxyl radicals results in 

the production of dihydroxyepoxides and 
reformation of the hydroxyls. These epoxides 
are readily taken up by acidic aerosols, 
forming tetraols and polymers. 

The researchers estimate that these 
expoxides account for nearly 100 teragrams 
of carbon in the atmosphere every year. 
Further understanding of this secondary 
aerosol formation should help to improve 
models of atmospheric chemistry. 


Stray genes 

Proc. Natl Acad. Sci. USA 
doi:10.1073/pnas.0902129106 
(2009) 


Semi-feral village dogs 
around the world carry 
complex mixtures of native 
and introduced genes, and 
could point to the origins of 
man’s best friend. 


University in Ithaca, New 
York, and his colleagues 
analysed mitochondrial and 
nuclear DNA in 318 dogs 
from seven areas of Africa 
and compared them with 
Puerto Rican street dogs, US 
mutts and representatives of 
126 recognized breeds. They 
were able to determine the 
mixture of indigenous and 


in each population. 

On the basis of 
mitochondrial DNA 
diversity, the authors 
urge closer scrutiny of 
recent suggestions that 
domesticated dogs have East 
Asian origins. The results also 
suggest that Pharaoh hounds 
and Rhodesian Ridgebacks, 
putatively African breeds, 
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Adam Boyko of Cornell non-native breed-dog genes have non-African origins. 
PHYSICS Brogioli estimates that the device could 
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Salt mined yield about 1.6 kilojoules per litre of fres 


Phys. Rev. Lett. 103, 058501 (2009) 

Mixing fresh water with salt water releases a 
substantial amount of energy, but scientists 
have struggled to find a way to harvest 

this inexpensively. Doriano Brogioli at the 
University of Milan-Bicocca, based in Monza, 
Italy, has proposed a new technique. 

He charged two porous carbon electrodes 
in salty water, then flushed the device with 
fresh water. Salt ions diffused away from the 
electrodes, increasing the voltage between 
them from 300 to 333 millivolts. Discharging 
the electrodes allowed energy to be extracted. 
(Apparatus and technique are detailed in 
graphic, below.) 
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water, which is on a par with the power 
produced by costly membrane-based 
methods. 


SPECIATION 


Multiplying effects 


Proc. Natl Acad. Sci. USA doi:10.1073/ 
pnas.0811575106 (2009) 

Duplications of the genome that result in 
‘polyploid’ organisms seem to be responsible 
for more plant-speciation events than was 
thought. 

Troy Wood of the University of Minster 
in Germany and his colleagues have used 
existing genetic data on vascular plants to 
establish the frequency of speciation events 
in which an increase in ploidy — the number 
of copies of the genome — occurred. They 
find that 15% of angiosperms and almost 
one-third of fern species are derived from 
a polyploid event that may have rendered 
a plant morphologically distinct from 
and sexually incompatible with its former 
brethren. This is four times higher than 
previous estimates and, the authors say, may 


still be a conservative figure. 

Although polyploidy seems to play an 
important part in creating new species, the 
team found no evidence that these polyploid 
plant lines go on to diversify more often than 
non-polyploids. 


NEUROBIOLOGY 


Have guts, get nerve 


J. Neurosci. 29, 9683-9699 (2009) 
Even in adulthood, gut neurons can renew 
themselves, scientists have found. 

Mintsai Liu at Columbia University in 
New York and her colleagues compared 
neurons in the guts of normal mice with 
those from mice lacking a receptor for 
serotonin (5-HT), a neurotransmitter 
associated with adult nerve generation in 
other parts of the body. 

They found that knocking out this 
receptor, 5-HT,, prevents the proliferation 
of neurons usually seen in the gut of 
newborn mice. In normal mice, drugs that 
activate 5-HT, receptors promote neuron 
survival and stimulate stem cells to make 
new neurons. 

The authors suggest that such drugs 
might be developed to repair damage to 
the enteric nervous system as might occur 
in inflammatory bowel disease. 


EVOLUTION 


Reinventing the egg 


Evolution doi:10.1111/j.1558-5646.2009.00790.x 
(2009) 
Many vertebrates have abandoned egg-laying 
for live birth. But despite some suggestive 
examples, convincing evidence for the 
evolutionary reversal of this trait has been 
lacking until now. 

Vincent Lynch and Ginter Wagner at 


Yale University, New Haven, carried out a 
phylogenetic analysis of 41 species of boa 
snake using recent DNA data. The most 
parsimonious explanation of the phylogeny, 
they conclude, is that the Arabian sand boa, 
Eryx jayakari (pictured below), one of only 
two species of egg-laying boa, re-evolved 
this ability some 60 million years after the 
transition of the group to live birth. 

E. jayakari also lacks the egg tooth other 
oviparous snakes use to tear their way out of 
the egg, additional evidence that egg-laying 
was lost and reacquired, according to the 
authors. 


SOIL ECOLOGY 


As different as day and night 


Biogeosciences 6, 1361-1370 (2009) 
When it comes to warming’s effects on plants, 
day and night are not equal. 

Shiqiang Wan and his collaborators at 
the Chinese Academy of Sciences’ Institute 
of Botany in Beijing set up 36 experimental 
plots on an Inner Mongolian steppe. They 
warmed some plots only during the day, 
others during night-time hours only, and yet 
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others around the clock. 

Plots warmed only at night turned the 
steppe from a net carbon source to a net 
carbon sink; the extra warming overnight 
stimulated respiration rates, boosting the 
plants’ daytime rate of photosynthesis and so 
their uptake of carbon dioxide. The effects 
of separate day- or night-time warming did 
not add up to equal the effects observed at 
constantly warmed plots. 

Many simulations of the effects of 
global warming on plants look at constant 
temperature elevations during a 24-hour 
cycle, but these may not be delivering the 
right picture. 


MOLECULAR BIOLOGY 


A regulator's regulator 


Genes Dev. doi:10.1101/gad.541609 (2009) 
Some small, non-coding RNAs regulate 
gene expression by binding to messenger 
RNA and preventing it from being 
translated into protein. The regulation of 
one bacterial gene, however, depends not 
just on one but on two non-coding RNAs, 
one of which inhibits the other. 

Lionello Bossi and his colleagues at the 
Centre for Molecular Genetics in Gif-sur- 
Yvette, France, studied the production of 
a bacterial protein called ChiP, which is 
involved in sugar uptake. In the absence of 
sugar, a small RNA called ChiX attaches to 
ChiP’s messenger RNA and prevents protein 
synthesis. Add sugar, however, and another 
RNA is produced that binds ChiX and targets 
it for destruction, allowing production of the 
ChiP protein. 

The discovery of this new feature of 
small RNAs in living cells could mean that 
regulatory RNAs identified computationally 
by sequence may in fact be targets of 
regulation. 
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Omar Tonsi Eldakar 
Center for Insect Science, 
University of Arizona 


An evolutionary biologist 
learns how to be remembered: 
cheat someone. 


What makes someone 
unforgettable? Is it their charm? 
Their looks? Or is it that they once 
stiffed you on the bill? 

Like many others, | have trouble 
remembering people's names, even 
as | am being introduced to them, 
but certain names remain etched in 


my mind forever. Few, for example, 
will forget Bernard Madoff, the 
New York financier convicted of 
defrauding people out of billions of 
dollars ina giant Ponzi scheme. 
Raoul Bell and Axel Buchner 
at the Institute of Experimental 
Psychology in Diisseldorf, 
Germany, have explored this bias 
in memory (R. Bell and A. Buchner. 
Evol. Psychol. 7, 317-330; 2009). 
They reveal that humans have a 
greater propensity to remember 
the names of individuals associated 
with cheating than names 
associated with trustworthiness or 
other unrelated behaviours. 


Cooperation is immensely 
beneficial to humans, but with 
cooperation looms the ever- 
present risk of exploitation. 
Researchers have proposed that 
humans have a specialized brain 
module dedicated to detecting 
and remembering cheaters, to 
help them to steer clear of future 
interactions with such individuals. 
It has previously been suggested 
that the cheater memory module 
is tied only to facial stimuli. 

But using the same behaviours 
associated with facial stimuli in 
previous studies, Bell and Buchner 
were able to replicate these 
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findings using only names, which 
suggests a more general module 
for remembering cheaters. 

Associating reputations with 
names is crucial to maintaining 
social norms through verbal 
mechanisms such as gossip. Thus 
memory bias for the names as 
well as the faces of cheaters could 
expand the ability of groups of 
individuals to avoid exploitation. 

Madoff probably won't have 
much luck if he tries to scam 
people again. 


Discuss this paper at http://blogs. 
nature.com/nature/journalclub 
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LHC experiments such as the Compact 
Muon Solenoid are still waiting to see 
particle collisions after the accelerator ‘K 
was left;crippled by anaccidentin t 
September 2008 (inset). 
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LHC hopes for collisions by Christmas 


Vol 460|13 August 2009 


But particle physicists will have to scale back the energies of their experiments for years. 


The Large Hadron Collider (LHC) should 
yield its first data by Christmas, smashing pro- 
tons at energies high enough to begin pushing 
back the boundaries of particle physics. But 
the world’s largest particle accelerator will 
only be operating at half the energy that it was 
originally designed for, and may not reach that 
peak until 2011 — ifat all. 

The new schedule was announced by 
CERN, Europe's particle-physics laboratory 
near Geneva, Switzerland, on 6 August. The 
6.5-billion Swiss franc (US$6-billion) LHC 
has been offline for nearly a year, following an 
accident on 19 September 2008 just nine days 
after the facility circulated its first particle 
beams to worldwide fanfare. When a faulty 
connection burnt out between two of the 
superconducting magnets used to accelerate 
particles, 8.7 kilo-amps of current arced out- 
wards in a massive short-circuit. The current 
ripped a hole in the pipe carrying liquid helium 
that surrounds the superconducting wire, 
spewing out soot and a blast of boiling helium, 
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which damaged a total of 53 magnets. 

During repairs this spring, a new set of prob- 

lems related to faulty magnet connections was 
discovered. Although CERN officials believe 
they now have the problem in hand, they 
are being cautious in ramping up the beams’ 
energies. In mid-November, protons will be 
injected into the 27-kilometre-long accelerator 
ring at low energies to show that 
two particle beams can circulate 
well in opposite directions. About 
four weeks later, the beams will be 
accelerated to about 3.5 teraelec- 
tronvolts (TeV) each, halfway to 
the LHC’s maximum beam energy. 
Smashing these beams together will create 
showers of exotic particles, and experiment- 
ers will begin gathering the data they need to 
calibrate their instruments. 

“We could have a Christmas present for the 
experimenters if were lucky,’ says Steve Myers, 
head of CERN’s accelerator department. Ifthe 
initial collisions go well, Myers says, engineers 
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will ramp up to energies between 4 and 5 TeV 
per beam sometime in 2010. The beams will 
then shut down in October or November 
2010 for six months in order to make further 
repairs and install more helium-release safety 
valves, before attempting to reach higher beam 
energies in 2011. 

Theorists say that LHC collisions with a 
total energy of 7 TeV should start to 
reveal previously unseen particles, 
although it might take a little longer 
than hoped to accumulate evidence 
for their existence. 

The standard model of parti- 

cle physics, describing the zoo of 
subatomic particles and the forces that con- 
trol them, begins to break down at energies 
above 1 TeV. Experiments at higher energies 
could reveal a whole new menagerie of par- 
ticles predicted by a popular successor to the 
standard model called supersymmetry. Also 
within reach are particles from a hypoth- 
esized form of dark matter, believed to make 
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up a quarter of the mass of the Universe. 
Finally, one of the most sought-after targets is 
the Higgs boson, the only particle predicted 
by the standard model that has not yet been 
found. The elusive boson is a marker of the 
Higgs mechanism, which could explain how 
particles have mass. 


Bad welds 

The 2008 accident was caused by a faulty weld 
in a section of superconducting wire connect- 
ing two magnets. The subsequent months of 
inspections of the rest of the LHC found just 
four more bad welds in this type of connec- 
tion — but also revealed a far more widespread 
problem. 

The superconducting wires are surrounded 
by copper wires, which act as ‘safety valves’ to 
carry any sudden surge in current. The copper 
wire will only come into play if the supercon- 
ducting wires warm up and lose their ability to 
conduct electricity without resistance. 

Tests over the past few weeks have revealed 
80 bad welds between sections of copper wire, 
but with roughly 10,000 copper-wire welds 
scattered around the LHC, not all have been 
inspected. This element of uncertainty means 
that CERN must increase the operating energy 
of the machine cautiously to avoid further acci- 
dents. 

The cost of repairs so far is 40 million Swiss 
francs. Myers acknowledges there was a quality- 
control problem with the welds, and that the 
systems for detecting current surges and miti- 
gating a catastrophic release of helium could 
have been improved. “It’s usually the simple 
things that cause you problems,’ he says. 

Peter Limon, a physicist at Fermi National 
Accelerator Laboratory in Batavia, Illinois, 
says these birth pains are “typical”. Limon 
points out that various problems meant that 
Fermilab’s Tevatron — currently the world’s 
highest-energy collider — took years to reach 
its maximum collision energy of 1.8 TeV. 
With a year or two of data collection needed 
for the LHC to make a definitive Higgs dis- 
covery, the Tevatron still has a chance at 
bagging it first. Tevatron physicists plan 
to present the latest update on their Higgs 
hunt on 18 August at the Lepton-Photon 
conference in Hamburg, Germany. 

Gordon Kane, a theorist at the University of 
Michigan in Ann Arbor, says that the physics 
community is frustrated by delays at the LHC, 
but adds that an extra year is not so long to hold 
on for the Higgs particle, given that its exist- 
ence was proposed in 1964. “I’m going to be 
there when it’s discovered,’ says the 72-year-old 
Kane, “no matter how long it takes.” a 
Eric Hand 
For more on the LHC, see www.nature.com/lhc 


Science advisers mull priorities 


An elite group of 21 US researchers met 
publicly for the first time last week as 
the new advisory panel to US President 
Barack Obama on scientific and technical 
matters. But despite an enthusiastic 
inaugural meeting, it will take time to 
know how effective the President’s Council 
of Advisors on Science and Technology 
(PCAST) will be. 

PCAST has already put together its 
first report, on the government’s H1N1 
pandemic strategy. Other topics likely to be 
high on its agenda include how science can 
help the economic recovery, and how best 
to deliver on Obama’s ambitious climate 
and energy research portfolio. 

Opening the meeting on 6 August, 
co-chair John Holdren called the council 
“a spectacular cast of leaders of our science, 
technology and innovation communities”. 
Holdren, who is Obama’s chief science 
adviser, chairs PCAST with Harold 
Varmus, former director of the National 
Institutes of Health in Bethesda, Maryland, 
and Eric Lander, director of the Broad 
Institute in Cambridge, Massachusetts. 
The full group boasts three Nobel laureates 
and 16 members of the national academies 
of science, engineering or 
medicine. 

The new PCAST “has 
a great membership and 
outstanding co-chairs, 
but its ability to influence 
events depends on 
who listens’, cautions 
John Marburger, who 


under Bush, the council launched in 
December 2001 with 24 members and 
expanded to 35 in 2005. PCAST’s previous 
incarnation focused more on technology 
advice, and it included fewer academic 
scientists and more business executives. 

“The best thing about this PCAST is 
that it’s up and running earlier in the term 
than the previous one,’ says Marburger. At 
the meeting last week, Holdren said that 
the council’s speedy formation was a clear 
signal that Obama thinks that science and 
technology are crucial in addressing global 
challenges. On 7 August, the council met 
with the president for a little over an hour. 

PCAST’s H1N1 report had been 
commissioned by Obama in late June. 
Shortly after, the council met for two days 
with around a dozen experts in public 
health, virology and other key fields. 

The report, which has not yet been made 
public, was delivered two weeks later and 
addressed the value of scenario planning, 
communicating complex messages to the 
public and policy-makers, and the types 
of legal, social, financial and other factors 
that could get in the way of responding 

to a pandemic. “Everyone felt that it was 
probably the best response 
ever to an epidemic event; 
says Lander. 

Last week’s meeting saw 
council members suggesting 
future subjects that could 
attract their scrutiny. 
Barbara Schaal, a plant 
geneticist at Washington 
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co-chaired the previous University in St Louis, 
incarnation of PCAST suggested weighing in on 

as the science adviser to the research agenda for the 
President George W. Bush. proposed National Institute 
PCAST?’s success or failure for Food and Agriculture, 
depends mainly on its John Holdren co-chairs the an institute authorized 
access to the president and = PCAST council. within the US Department 
on its interactions with of Agriculture but not 


various other advisory groups within the 
administration. These include the Office 
of Science & Technology Policy, which 
Holdren directs, and the National Science 
and Technology Council, composed 
mainly of the heads of agencies that deal 
with scientific matters, along with the US 
vice-president. 

Last November, Marburger and the 
other outgoing PCAST co-chair, venture 
capitalist Floyd Kvamme, left a memo 
offering their successors some advice, 
including limiting the number of members: 
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yet funded. Lander, also a geneticist, 
suggested looking at governmental 
approaches to cancer research, including 
evaluating whether the most creative 
approaches get support in their early days. 
And geochemist Daniel Schrag of Harvard 
University suggested tackling national 
strategies for research into adaptation to 
cope with the effects of climate change. 
The next PCAST meeting is slated for late 


October. P| 
Alexandra Witze and Lizzie Buchen 
See Editorial, page 781. 
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Ice-core researchers hope to chill out 


Fresh freezers needed to preserve ancient gas, scientists say. 


Researchers in the United States and Europe 
are seeking funding so that the ice cores used 
to study Earth's past climate can have the same 
luxuriously chilly storage facilities currently 
enjoyed by prize tuna. 

The cylindrical cores, drilled at multi-million- 
dollar expense from polar and glacial ice, can 
be kilometres long. They contain tiny bubbles 
of trapped air, allowing scientists to measure 
the concentration of carbon dioxide in the 
atmosphere from hundreds of thousands of 
years ago. The relative ratios of oxygen and 
nitrogen, and their isotopes, can also reveal 
temperature variations and help to date the 
trapped gas. 

But oxygen is gradually lost when ice cores 
are stored at —20°C to —30°C, the standard 
temperature of current cold-storage facilities 
in the United States and Europe (M. Bender, 
T. Sowers and V. Lipenkov J. Geophys. Res. 100, 
18651-18660; 1995). 

The oxygen diffuses slowly toward the core’s 
surface and can be lost to the air, resulting in 


Unless stored correctly, ice cores lose their value. 


higher ratios of nitrogen to oxygen that can dis- 
tort the conclusions scientists draw from them. 
In 2005, Japanese researchers showed that 
colder storage temperatures could mitigate this 
gas loss (T. Ikeda-Fukazawa et al. Earth Planet. 


Sci. Lett. 229, 183-192; 2005), and the ice-core 
storage facility at the National Institute of Polar 
Research, Tokyo, was subsequently upgraded 
with technology used to maintain cold stores 
for high-grade tuna at —50°C. 

US researchers are now seeking roughly 
$5 million to rebuild cold-storage facilities 
at the US Army Cold Regions Research and 
Engineering Laboratory in Hanover, New 
Hampshire, which would also have its storage 
capacity increased by about 25%. Ian Baker, a 
materials scientist at Dartmouth College, also 
in Hanover, is leading the application for fund- 
ing from the National Science Foundation. If 
the team gets its funding, Dartmouth would 
then manage the refurbished facility, which 
could be completed by 2010. 

European scientists face a similar challenge. 
“We need a European-wide facility,’ says Eric 
Wolff at the British Antarctic Survey in Cam- 
bridge, UK. But a European Commission study 
in 2008 estimated that a new —50°C facility, with 
associated research labs and teaching facilities, 
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Flu database rocked by legal row 


EXCLUSIVE 

A question mark is hanging over the future 

of EpiFlu, an international database created 

to help monitor the spread and evolution of 
influenza viruses. The database openly shares 
genetic, epidemiological and clinical data that 
previously had often been hoarded by countries 
and scientists, and is contributing to the rapid 
analysis of viral gene sequences from the current 
H1N1 pandemic. 

EpiFlu has become mired in a legal dispute 
between the Global Initiative on Sharing Avian 
Influenza Data (GISAID), an international group 
created by leading flu researchers in August 2006 
to promote data sharing (see Nature 442, 981; 
2006), and the Swiss Institute of Bioinformatics 
(SIB) in Geneva. GISAID announced a contract 
with the SIB in December 2006 to build the EpiFlu 
database. Most funding for EpiFlu came from the 
US Centers for Disease Control and Prevention 
and the Swiss government. 

The row became public on 27 July, when 
visitors to the EpiFlu landing page on the GISAID 
website were met with a message from the SIB 
informing them that the database was currently 
unavailable on that site “due to contractual and 
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legal issues", and instead was available only 
to users redirected to a SIB website. The same 
day, the SIB e-mailed the same message to all 
registered users of the database. 

The next day, GISAID also e-mailed users to 
assert that the SIB “had no right [to cut off the 
access to the EpiFlu Database via GISAID] or to 
operate the EpiFlu Database on its own”. 

Both SIB and GISAID officials declined to 
discuss the details because of the ongoing legal 
dispute. But the SIB alleges that GISAID has 
breached its contract by failing to pay its bills on 
time. GISAID officials say that although there 
had been a hiatus in its funding, 


She asserts that the SIB's continued operation 
of the database, including GISAID's trademark 
logos and data-access agreement, amounts to 
“misappropriation” of the database, and claims 
that under the terms of the contract, all EpiFlu 
data remain GISAID's property, giving the SIB no 
right to operate the database independently. 

However, the SIB had written to GISAID 
in September 2008 declaring that it was 
terminating their contract. The SIB argues that 
GISAID has not settled its bills in full and that, 
under Swiss law, a default on payment renders a 
contract null and void, giving the SIB the rights to 

the database it built. 


they had acted in good faith "| hope they reach Since then, the SIB claims that it 
and had subsequently obtained the best solution has operated the database without 
additional funds to meet some of asking for more money from 

for the sake of 


the bills. 


“GISAID continues to workwith global health.” 


the SIB to resolve their monetary 

dispute,” says Cheryl Bennett, an official at the 
GISAID Foundation’s Washington DC office. 
“However, before the monetary dispute can fairly 
be resolved, GISAID must receive an accurate 
accounting from the SIB, which GISAID has 
requested but the SIB has thus far not provided.” 
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GISAID, and has taken fund-raising 
for the database into its own hands. 
Although researchers can still 
access the database, public-health officials 
are worried by the longer-term implications of 
the row. Before the creation of GISAID, several 
countries — including Indonesia, an avian flu 
hotspot — had refused to share sequence data on 
the grounds that they got little in return. But they 
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$15,000, but the necessary freezer adaptations 
void warranties and there are concerns that 
keeping sections of ice cores in many different 
locations may hamper researchers’ access. 

His lab is currently studying oxygen/nitro- 
gen ratios in ice cores drilled last year for the 
Western Antarctic Ice Sheet Divide Ice Core 
project, a collaboration of US research groups. 
Preliminary comparisons with the Byrd ice 
core, drilled in 1968 in Western Antarctica, 
show that oxygen has been lost in the older 
core during storage, he says. a 
Rex Dalton 


were persuaded to share by GISAID's terms of 
access, similar to those of open-source software, 
in which all users agree to share their own data 
and give due credit to the originators. 

“Indonesia has been supportive of GISAID 
from the outset,” says Widjaja Lukito, a senior 
policy adviser at the country's ministry of health. 
He adds that he is watching “with great concern 
the problems faced by the GISAID Initiative 
during this pandemic phase". He says that he 
strongly hopes that “the parties involved will 
reach the best solution for the sake of global 
public health”. 

One flu scientist, who is a member of GISAID's 
scientific council, adds that EpiFlu is more than 
just the database computing infrastructure built 
by the SIB. Considerable effort has been invested 
by GSAID and its partners in “building trust" 
between the scientific community and countries 
who have sequence data to deposit, and in 
contributing influenza expertise. 

Lawyers for both the SIB and GISAID have in 
the past weeks made confidential settlement 
offers to each other, and both say they are keen 
to reach a peaceful settlement. “The SIB is 100% 
committed to the EpiFlu database as part of the 
GISAID initiative. We are working very hard to 
find the best possible solution for all concerned,” 
says Ron Appel, head of the SIB. | 
Declan Butler 


to monthly global surface 
temperature data held by 
the institute. But in recent 
weeks, Jones has been 


went to press, Jones was 
expected to post a statement 
on the CRU website to that 
effect, including any existing 


swamped by a sudden confidentiality agreements. 

surge in demands for data. Jones says any such data 
Several organizations release “needs to be done ina 

worldwide collect and systematic way”. 

report global average “We're trying to make 

temperature data for them all available,” says Jones. 

each month. Of these, Steve McIntyre wants access “We're consulting with all 

a temperature data set to more UK data. the meteorological services 

held jointly by CRU and — about 150 members [of 


the UK Met Office’s Hadley Centre in 
Exeter, known as HadCRU, extends back 
the farthest, beginning in 1850. Although 
these data are made available in a processed 
format that shows the global trend, access 
to the raw data is restricted to academics. 
Between 24 July and 29 July of this year, 
CRU received 58 freedom-of-information- 
act requests from McIntyre and people 
affiliated with Climate Audit, requesting 
access to the data or information about 
their use. In the past month, the UK Met 
Office, which receives a cleaned-up version 
of the raw data from CRU, has received ten 
requests of its own. 
McIntyre, based in 
Toronto, Ontario, is best 
known for questioning the 
validity of the statistical 
analyses used to reconstruct the past 1,000 
years of climate, but has more recently 
turned his attention to criticizing the 
quality of global temperature records. 
Jones concedes that raw climate data have 
imperfections — such as duplication 
of stations — but says that such minor 
errors would not alter the overall global 
temperature trend. McIntyre insists that 
he is not interested in challenging the 
science of climate change or in nit-picking, 
but is simply asking that the data be made 
available. “The only policy I want people to 
change is their data-access policy,” he says. 
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“Data release 
needstobedonein peter Webster, a hurricane 
a systematic way.” 


the World Meteorological Organization] 
— and will ask them if they are happy to 
release the data.” A spokesperson for the 
Met Office confirmed this, saying “we are 
happy for CRU to take the lead on this, as 
they are their data”. 

But getting the all-clear from other 
nations won't be without its challenges, 
says Jones, who estimates that it could 
take several months. In addition, some 
nations may object if they make money 
by selling their wind, sunshine and 
precipitation data. 

The dispute is likely to continue for 
some time. McIntyre is 
especially aggrieved that 


expert at the Georgia Institute 
of Technology in Atlanta, was 
recently provided with data that had 
been refused to him. 

Webster says his team was given the 
station data for a very specific request that 
will result in a joint publication with Jones. 
“Reasonable requests should be fulfilled 
because making data available advances 
science,” says Webster, “but it has to be an 
authentic request because otherwise you'd 
be swamped.” 

Indeed, Jones says he has become 
“markedly less responsive to the public 
over the past few years as a result of this”. 
Olive Heffernan 
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Return of the rat 


European investment could see knock-out rats catching up with mutant mice in medical research. 


The European Commission has 
approved the world’s first major systems- 
biology programme to study the rat. 

Known as EURATRANS — for Euro- 
pean large-scale functional genomics in 
the rat for translational research — the 
multimillion-euro project includes 
collaborators in the United States and 
Japan. The aim of the initiative is to 
expand databases of genes, proteins and 
other biomolecules, analysing the infor- 
mation and translating it into a form 
that is useful to clinical researchers. 

The effort represents a comeback 
for the rat, which fell from scientific 
prominence during the mouse-domi- 
nated genomics era, despite the unprec- 
edented amounts of physiological data 
that have been gathered from it over the 
centuries. 

“The rat is a better model than the 
mouse for many complex disorders that 
are so common in humans, like cardio- 
vascular and psychiatric disease,” says 


A Europe-led project could push the lab rat back into the spotlight. 


to unpick the many genes involved in 
these pathologies. 

But identifying genes is not enough 
to understand complex diseases, stresses 
Hiibner. “It is a question of what hap- 
pens to those genes.” For example, a 
recent genome-wide association study in 
thousands of people implicated the gene 
for the enzyme HMGCoA reductase as 
a risk factor in cardiovascular disease 
— but found the risk to be small’. Yet 
the statin drugs that target this enzyme 
are among the most effective for cardio- 
vascular disease. “That's why we need 
to integrate genomic information with 
other molecular information and learn 
more about entire molecular pathways 
in cells that may go wrong in disease,” 
says Hubner. 

The systems-biology approach of 
EURATRANS aims to integrate terabytes 
of information — on gene sequences, 
gene transcription and chemical modifi- 
cation, as well as proteomic and metabo- 


EURATRANS coordinator Norbert 
Hiibner, a geneticist at the Max Delbriick 
Center for Molecular Medicine in Berlin. “The 
project will help rat genetics catch up with the 
many-years head start that mouse genetics has 
enjoyed.” 

In the late 1980s, researchers developed 
a technique to knock out single genes from 
mice using embryonic stem cells as a starting 
material, making it possible for geneticists to 
engineer the mutant strains needed to model 
human disease. Rat genetics 
proved trickier to manipulate, 
and the mouse’s popularity as a 
lab animal soared. 

Two things have happened 
in the past few years that make 
a major assault on the rat feasi- 
ble and worthwhile, says Hiib- 
ner. “First, the huge advances in sequencing 
and other molecular technologies, and second, 
the problem which was preventing the devel- 
opment of gene knock-out technology in rats 
has been overcome.” Last year, Austin Smith, 
director of the Wellcome Trust Centre for 
Stem Cell Research at the University of Cam- 
bridge, UK, and his colleagues achieved this by 
altering the culture medium conditions used 
to grow embryonic cells in vitro’. Knock-out 
rats have not yet been created using these cells, 
but this is one of the aims of the EURATRANS 
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“The rat is a better 
model than the mouse 
for many complex 
disorders that are 
common in humans." 


project, in which Smith is also a participant. 

The 16 institutions participating in EURA- 
TRANS will receive a total of €10.5 million 
(US$14.9 million) from the commission, which 
they will match from their own resources. The 
money will be used to create detailed genome 
sequences of the eight progenitor strains of 
rat that gave rise to the stock rats generated 
at the US National Institutes of Health (NIH) 
laboratories in Bethesda, Maryland, the diverse 
genetics of which mirrors that 
of human populations. 

The project will also apply 
state-of-the-art technologies 
in order to generate data on rat 
functional genomics. Thirty 
additional ‘recombinant inbred 
strains’ created at the Prague- 
based Institute of Physiology, part of the 
Academy of Sciences of the Czech Republic, 
will also get this treatment. These rats were 
made by crossing the two strains of rat that 
have already had their genomes sequenced — 
the Norwegian brown rat (Rattus norvegicus)” 
and the albino spontaneously hypertensive 
rat, which develops high blood pressure early 
in life. The crosses display a range of clinical 
phenotypes, such as variations in blood pres- 
sure and glucose tolerance, and resistance to 
insulin, so studying them may help researchers 
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lomic data — in order to identify these 
molecular networks. The consortium will then 
confirm the networks’ involvement in disease 
using appropriately engineered rats. 

Smith points out that the growing use of 
mutant rats will come at a cost. Being larger 
than mice and taking longer to reach maturity, 
rats are more expensive to keep, although their 
size can also make them easier to work with. 
“Funding agencies will need to be persuaded 
that it is worth the investment,” he says. 

EURATRANS is one of the first projects 
to be approved under a reciprocal funding 
arrangement agreed in late 2008 between the 
European Union and the NIH. This means that 
a consortium does not have to prove, as it still 
does for other non-EU countries, that no labo- 
ratory with the same skills exists in Europe. 

EURATRANS participant Tadao Serikawa 
of Kyoto University in Japan, who heads the 
National BioResource Project—Rat initiative, 
says that the global nature of the EU project 
is important both for exchange of skills and 
information, and because “it avoids redundant 
experiments, which saves animals, time and 
money and accelerates biomedical progress”. m 
Alison Abbott 


1. Buehr, M. etal. Cell 135, 1287-1298 (2008). 

2. Rat Genome Sequencing Project Consortium Nature 428, 
493-521 (2004). 

3. Kathiresan, S. et al. Nature Genet. 40, 189-197 (2008). 
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Satellite data show 


Unsustainable water use in India is 
threatening agricultural production and 
raising the spectre of a major water crisis. 

Matthew Rodell of NASA’s Goddard 
Space Flight Center in Greenbelt, Maryland, 
and colleagues used 


data from the Gravity “If farmers could shift 
Recovery and Climate away from water-intensive 
Experiment (GRACE) Aol t 
satellites — operated by crops am mrp einen : 
NASA andtheGerman __ more efficient irrigation 


Aerospace Center 
(DLR) — to determine 
how groundwater levels are changing in 

the Indian states of Rajasthan, Punjab and 
Haryana, which includes the national capital 
of New Delhi. 

Their research, published online in Nature 
this week (M. Rodell et al. Nature doi:10.1038/ 
nature08238; 2009), found gravity anomalies 
suggesting a net loss of 109 cubic kilometres 
of water — equivalent to a mass of 109 billion 
tonnes — from August 2002 to October 2008. 


methods, that would help.” 


The amount lost is double the capacity of 
India’s largest surface-water reservoir, the 
Upper Wainganga, and almost three times the 
capacity of Lake Mead in Nevada, the largest 
reservoir in the United States. 

A second study using 
GRACE data, by scientists at 
the University of Colorado 
and the National Center for 
Atmospheric Research in 
Boulder, has found that the 
most intensively irrigated 
areas in northern India, 
eastern Pakistan and parts of Bangladesh 
are losing groundwater at an overall rate of 
54 cubic kilometres per year, consistent with 
Rodell’s results (V. M. Tiwari et al. Geophys. Res. 
Lett. doi:10.1029/2009GL039401; in the press). 

Groundwater depletion in northwest India 
isa known problem, but Rodell’s data suggest 
that the loss rate is around 20% higher than the 
Indian authorities have previously estimated. 

Rodell notes that rainfall during the 
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ndian water stocks shrinking 


study period was close to the long-term 
climatic mean, and says that the observed 
groundwater depletion is unlikely to be the 
result of unusual dryness or variability. 
The regions of Rajasthan, Punjab and 
Haryana have a combined population of 
114 million people, and receive an average 
of 500 millimetres of rainfall per year — 
just slightly less than that of London — but 
with pronounced seasonal and regional 
differences. Although less than a third 
of agricultural land there is irrigated, 
crop irrigation accounts for up to 95% of 
groundwater consumption. “If farmers could 
shift away from water-intensive crops, such as 
rice, and implement more efficient irrigation 
methods, that would help,” says Rodell. 
Meanwhile, the Indian government is 
looking into framing regulations to reduce 
groundwater consumption. “Hopefully,” 
says Rodell, “our research will give them the 
evidence they need to carry through.” a 
Quirin Schiermeier 
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Europe prepares for drugs 
from GM plants 


The first European guidelines on crop 
pharming — growing transgenic plants 
to produce drugs — were published 

on 7 August, clearing an obstacle to 
commercial development. 

The guidance was drawn up by the 
European Food Safety Authority in Parma, 
Italy, which will assess applications from 
scientists hoping to grow drug-producing 
plants on a commercial scale. The European 
Commission and member states will have 
the final say on all such plans. 

Developers must detail the differences 
between the modified plant and the 
conventional crop, assess the risks of genes 
being passed on to other plants, consider the 
potential dangers to humans and animals 
and specify monitoring procedures. This 
is consistent with rules set out by the US 
Food and Drug Administration and the 
US Department of Agriculture, says Jeffrey 
Wolt, an expert in plant biotechnology risk 
assessment at Iowa State University in Ames. 

Internationally, only a handful of clinical 
trials of plant-pharmed drugs are under way 
(see Nature 458, 951; 2009), and there are 
none on the market. 

For a longer version of this story, 
see http://tinyurl.com/m8uvgz 


Presidential panel lays out 
options for NASA's future 


The committee deliberating NASA’ future 
has outlined seven possible scenarios to 
shape US space operations for the coming 
decade. The ten-person panel, chaired 
by Norman Augustine, retired head of 
aerospace giant Lockheed Martin, set out its 
choices in a public meeting in Washington 
DC on5 August. 

Five of the seven scenarios would extend 
the life of the International Space Station 
by five years to 2020; three could allow the 
space shuttle to continue past 2011; but only 
two spell out a future for Ares I, the rocket 
being built to replace the shuttle as the main 
way of getting astronauts into space. 

The number of options presented to the 


Norman Augustine: laying plans for NASA. 


Novartis targeted by animal-rights extremists 


Militant animal-rights activists 
were responsible for the 
torching of the Alpine holiday 
home (pictured) of Daniel 
Vasella, head of the Swiss 
pharmaceutical company 
Novartis, in the early hours of 
3 August, Swiss authorities 
assume. The chalet was 
destroyed but no one was hurt. 
Novartis has reported at least 
ten other recent attacks on 
its employees. These include 
the desecration of graves of 
Vasella’s family and the theft 
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of his mother's ashes at the end of last month, as well as the placing of incendiary devices under the 
cars of Novartis’s administrative chief, Ulrich Lehner, in May. 

Swiss police say they expect further attacks. The Stop Huntingdon Animal Cruelty (SHAC) group 
has called an Action Week for 24-30 August against major customers of Huntingdon Life Sciences, 
acontract animal-testing company near Cambridge, UK, that has faced a sustained campaign of 
attacks. Novartis denies being a Huntingdon Life Sciences customer, as the SHAC website claims. 


president — whose administration has 
called for a reduced NASA budget — may 
change between now and 31 August, when 
the panel will deliver its final report. 

For alonger version of this story, 

see http://tinyurl.com/mrtrf5 


Batteries feel the benefit 
of green car money 


The US Department of Energy rolled out 
$2.4 billion in stimulus grants on 5 August 
to develop next-generation technologies for 
electric cars. 

The bulk of the awards, made to 
48 projects in more than 20 states, will fund 
factories to make battery components. 
Smaller pots of money go to producing 
electric motors and other drive components, 
and to fund road-testing of plug-in hybrids 
and all-electric cars. 

Established car makers and battery 
manufacturers took most of the money. 
Mark Platshon, a clean-technology analyst 
with VantagePoint Venture Partners in San 
Bruno, California, says he had hoped to see 
more handed to smaller, more innovative 
companies. “It’s scaling up the existing guys 
with the existing technology in the existing 
way,’ he says. 

For a longer version of this story, 
see http://tinyurl.com/nhuy7v 


Stalled science buildings 
restart in California 


The University of California (UC) last week 
came up with US$200 million to finance 
research and medical facilities despite a 
state-wide budget catastrophe (see Nature 
460, 441; 2009). 
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Of the ten UC campuses, eight will 
receive funds. Santa Cruz tops the list of 
research allotments, receiving $64 million 
for a biomedical science facility, Irvine 
wins $5 million for biological, engineering 
and computer sciences, and Riverside 
gets $4.6 million for equipment to stock 
newly completed materials-science and 
engineering buildings. 

Slashing $813 million last month from its 
$3.2-billion annual budget, the UC system 
has forced staff to take unpaid leave and had 
halted building work. The $200 million was 
made available on 4 August after the UC 
system secured a commercial bank loan to 
buy bonds from the state of California — 
which used the financing to restart stalled 
infrastructure projects. 


German scientists found 
guilty of negligence 


An independent investigation committee 
has found four scientists at the elite 
University of Géttingen, Germany, guilty 
of gross negligence because they included 
three unfinished manuscripts in publication 
lists supporting an application for research 
funding. But the researchers did not invent 
or falsify data, the committee says. 

The application was for the continuation 
of a collaborative research centre (SFB): a 
highly prized award funded by the DFG, 
Germany’s major national research agency. 
The collaboration was set up to study 
the stability of the Indonesian rainforest 
and had received €16.6 million (US$22.3 
million) from 2000 to June 2009 (see Nature 
459, 23; 2009). 

Investigations by the university and local 
public prosecutors into possible financial 
irregularities continue. 
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FIXING THE SKY 


When nations made plans to save the ozone layer, they didn’t factor in global warming. 
Quirin Schiermeier reports on how two environmental problems complicate each other. 


ater this month, something sinister will start to take 

shape above Antarctica. As sunlight reappears in the 

polar skies after the long winter, chlorine and bromine 

compounds in the stratosphere will begin destroying 
part of the ozone layer that shelters Earth’s surface from 
harmful ultraviolet radiation. Over the next few months, 
these pollutants will eliminate enough ozone to create a hole 
in the protective veil over the Antarctic continent. 

That same phenomenon has occurred every spring 
since the late 1970s, although it took several years before 
scientists recognized and documented the emerging pat- 
tern. When, in 1985, a team of researchers from the British 
Antarctic Survey published a paper in Nature describing 
the ozone hole’, the world was put on high alert. At about 
the same time, it was becoming clear that the ozone shield 
over much of the planet was vulnerable to pollutants emit- 
ted into the atmosphere. That concern propelled countries 
in 1987 to agree to phase out the production of ozone- 
destroying compounds — which stands out even today 
as the most ambitious action ever taken to tackle a global 
environmental problem. 

Thanks to that agreement, the Montreal Protocol, nations 
have made great strides towards repairing the planet's 
sunshield. Although the ozone hole that appears over Ant- 
arctica each year remains as bad as ever, the amount of 
chlorine in the atmosphere has started to decline and the 


Ina NASA simulation, 
a world with no 
controls on chlorine 
and bromine pollution 
leads to extreme 
ozone loss (blue hues) 
over Antarctica. 
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rest of the ozone layer has started to show signs of recovery 
from a less dramatic, but still dangerous, thinning. 

Amid the good news, however, lurk big questions about 
how long it will take to fix the sky. A decade ago, researchers 
projected that the ozone layer would fully recover by 2050, 
but now there is far more uncertainty in their estimates. 
One of the complicating factors is that greenhouse gases 
have altered atmospheric conditions in many ways since 
the Montreal Protocol was signed, some of which speed up 
ozone recovery and some of which delay it. 

Scientists are only now gaining the necessary computing 
power to run long-term simulations that allow them to test 
which effects of climate warming might win out. These types 
of studies suggest that part of the global ozone layer will 
recover decades earlier than previously thought, whereas 
the Antarctic ozone hole may linger decades longer than 
was once hoped. 


Complex interplay 

Just as the climate influences ozone, changes to the ozone 
layer will, in turn, alter the climate. Ozone loss over Ant- 
arctica has already affected the climate there, for example, 
by helping to warm the Antarctic Peninsula and thereby 
contributing to the destruction of several ice shelves. And 
it may have tilted the odds towards more frequent droughts 
and fires in Australia. 
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“Stratospheric ozone and surface climate are coupled in 
many ways,’ says Susan Solomon, an atmospheric chemist 
at the National Oceanic and Atmospheric Administration 
in Boulder, Colorado. “It’s a fascinating interplay of which 
we may not yet know the whole story.” 

Well before the ozone hole was recognized, scientists had 
started to worry about the effects that humans were having 
on the ozone layer. In 1974, Sherwood Rowland and Mario 
Molina, chemists at the University of California, Irvine, 
warned that chlorofluorocarbons (CFCs) could break down 
in the stratosphere and the chlorine released could destroy 
atmospheric ozone’. They later shared the 1995 Nobel Prize 
in Chemistry with Paul Crutzen of the Max Planck Insti- 
tute for Chemistry in Mainz, Germany, for their pioneering 
work in understanding ozone chemistry. 

Concern over that global thinning and the ozone hole 
led to the 1987 treaty known as the Montreal Protocol on 
Substances that Deplete the Ozone Layer. The treaty came 
into effect 20 years ago and was followed up by amend- 
ments that banned the worst ozone-destroying chemicals, 
such as CFCs (used in refrigeration, air-conditioning and 
foam production) and bromine-containing halons (used 
in fire extinguishers). The agreements yielded quick divi- 
dends: the effective concentration of ozone-destroying 
compounds in the stratosphere peaked in the late-1990s 
and has declined since then’. 

The Montreal provisions also came with a free gift for 
the climate because CFCs and their kin are much stronger 
greenhouse gases than carbon dioxide. By getting rid of 
those compounds, the ozone agreement has achieved five 
to six times greater reductions in warming effect than the 
Kyoto Protocol’. 

“The Montreal Protocol is just the most successful piece of 
international environmental legislation ever,’ says Solomon. 
“Tt also contains the unwritten memo to climate negotia- 
tors that we need to, and can, do much better in controlling 
greenhouse warming than we have been doing so far.” 


The ‘world avoided’ 

A glance at a hypothetical future with no ozone protections 
illustrates what could have been. Paul Newman of NASA‘s 
Goddard Space Flight Center in Greenbelt, Maryland, 
and his colleagues, used a model that simulates chemical 
reactions, atmospheric circulation and solar radiation to 
forecast a future in which production of ozone-depleting 
compounds is never regulated and grows at an annual rate 
of 3%. By 2065, two-thirds of the ozone would be destroyed 
— not just over the poles but everywhere®. CFCs would 
virtually eliminate the global ozone layer by the end of 
the century. People in New York, Buenos Aires and Tokyo, 
like everyone else living in mid-latitudes (the temperate 
regions between about 30° and 60° from the Equator), 


“If you have 

to have an 
ozone hole, the 
best place is 
Antarctica. In 
the tropics it 
would be very 


damaging to 
life." 
— Susan 
Solomon 


Controls on pollution 
are projected to 
prompt a recovery of 
the ozone hole later 
this century. 
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would be exposed to ultraviolet radiation so extreme that 
they would develop dangerous sunburn within 5 minutes, 
on average, one-third of the time it takes today. The level of 
DNA-mutating ultraviolet radiation would rise about six- 
fold, dramatically boosting skin cancer cases in humans 
(see ‘Ozone and cancer’). 

But the ozone treaties will spare humans from living ina 
world unshielded from the might of the Sun. “The worst has 
been avoided,” says Martin Dameris, an atmospheric and 
climate scientist at the German Aerospace Centre in Ober- 
pfaffenhofen near Munich. “But itis also a reminder that we 
must not even think about watering down the protocol.” 


Continued pollutant use 

Environmental groups worry in particular about methyl 
bromide, which is used in agriculture to control pests. 
Under the provisions of the ozone-protection treaties, 
developed nations were supposed to stop using this 
compound by 2005, with developing nations following 
10 years later. But lobbying by farming groups led to a 
stay of execution and the chemical is still in use in many 
developed countries. Another complicating factor is the 
large volume of chlorine and bromine stored in older air- 
conditioning and fire-fighting systems, much of which will 
eventually make its way into the atmosphere. 

Once the chemicals get there, they linger for decades. 
So even though CFCs and several other ozone destroy- 
ers have been phased out, they will continue to eliminate 
ozone for many years. At present, stratospheric ozone 
concentrations around the globe are about 4% below the 
1964-80 average, but the depletions differ substantially 
between hemispheres and latitudes. In tropical regions, 
there is relatively little ozone loss. Over the mid-latitudes, 
where the air mixes more readily with ozone-depleted air 
parcels from the poles, total ozone loss has reached 3% in 
the Northern Hemisphere, and around 6% in the Southern 
Hemisphere, since 1980 (ref. 3). 

Although relatively minor compared with the extreme 
ozone loss over Antarctica each spring, the mid-latitude 
changes have a notable effect because so much of the popu- 
lation lives there. The additional ultraviolet light may cause 
hundreds of thousands of extra cases of skin cancer each 
year worldwide, with the full effects still decades away’. 

But there is good news for the mid-latitudes, because 
ozone concentrations there have started to show signs of 
rebounding. “It seems as if mid-latitude ozone is going 
up,” says Richard Stolarsky, an atmospheric chemist at 
Goddard. The upward trend is still not clear enough to be 
able to firmly establish its cause, he says, “but we're pretty 
confident that reduced chlorine load [in the atmosphere] 
is contributing to the rebound”. 

Although researchers once estimated that it would take 
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until 2050 for the mid-latitude ozone layer to ~OZONE'S LONG JOURNEY which effect will win out. The results 
recover fully, model simulations now suggest | Ozone molecules are produced in the stratosphere over the tropics ~~ are not consistent from one model to 
that point will come perhaps 20 years ear- a are then transported towards thevpoles where they sink intothe: the next and there is still considerable 
i i . . ower stratosphere. Over Antarctica, a vortex of winds isolates the 7 fs : 
lier, especially in the Northern Hemisphere, jar stratosphere during winter and spring, which bottles up the uncertainty on this topic, says Darryn 
which is not greatly affected by the Antarc- cold air there and fosters the formation of springtime ozone holes. Waugh, an atmospheric scientist at 
tic ozone hole*®. The faster recovery can be Johns Hopkins University in Baltimore, 
considered a benefit of global warming. Maryland. 
As greenhouse gases trap heat in the lower One factor holding back progress in 
atmosphere, they cool the stratosphere this area is the time it takes to conduct 
above, which slows down the ozone- the research. There are still relatively 
destroying chemical reactions. More- few groups with the computing power 
over, models that simulate climate to run long simulations with sufficiently 
and atmospheric chemistry suggest complex models that mimic both 
that global warming will speed up atmospheric chemistry and climate. In 
the circulation pattern that carries the case of the Princeton lab, it takes 
ozone-rich air from the tropics to 3 months of continuous computing 
the mid-latitudes, thereby boosting with a 100-processor system to con- 
ozone amounts there. ducta 100-year simulation, says Austin. 
“Thave to watch it every day,’ he says, to 
make sure that the experiment does not 
get stuck. “Otherwise three months turns 
into four or five.” 
For now, the ozone hole above Antarc- 
tica shows no sign of getting any better. In 
2002 and 2004 the ozone loss was less severe, 
Polar vortex Dut in 2006 ozone levels fell to a new record 
low and have since remained depressed. 
Most scientists don’t expect the recovery to 
start until at least 20 years from now — a decade 
or so later than was projected just five years ago. 


Tropical 


Quick reactions 

Closer to the poles, however, 
the influence of global warming 
becomes less clear, especially for the 
Antarctic ozone hole. Cooling in the 
stratosphere there would, by itself, lead 
to more ozone destruction by stimulat- 
ing the growth of polar stratospheric 
clouds. In the extremely dry stratosphere 
above Antarctica, such clouds form when 
temperatures drop below —78°C, and they pro- 


Antarctica 


vide a surface for the chemical reactions that rapidly strip an That is because models that simulate the movement 
oxygen atom from the three-oxygen ozone molecule. of chemicals suggest that the stratosphere over Antarctica 
The ice particles play a powerful part. Outside the polar will remain saturated with ozone-destroying substances for 
regions, a chlorine atom in the stratosphere can eliminate another 10-20 years. And according to many models, it will 
only a few hundred ozone molecules before it reacts with take until 2060 or 2065 for nearly complete recovery, when 
other gases such as nitrous oxide, breaking the cycle. But polar concentrations of ozone-destroying compounds drop 
on the surface of polar ice particles, it can catalytically con- below their 1980s values. 
vert tens of thousands of ozone molecules. And it happens The latest modelling results have raised an additional 
quickly. In the heart of the Antarctic ozone hole, between 14 complicating factor: the influence of bromine compounds. 
and 21 kilometres above the surface, the loss rate can reach Concentrations of bromine in the stratosphere are higher 
up to 3% per day”. By the beginning of October, the cloud- than models suggest they should be, perhaps because more 
mediated reactions destroy nearly every ozone molecule in bromine is making its way into the stratosphere than had 
that altitude band. been predicted. If that trend continues, a small ozone hole 
In 1992, researchers predicted that greenhouse warming — about one-tenth its current size — would continue to 
would speed up the destruction so strongly that it would form for decades beyond 2065, according to work done by 
cause ozone holes to open above the Arctic as well’. But "Ozone the Princeton lab. “Even by the end of the century, you don't 


that analysis left out an important effect, says John Austin, ; see complete recovery,’ says Austin, who cautions that this 
an author of the study and a modeller at the Geophysi- chemistry and result is still preliminary. 
cal Fluid Dynamics Laboratory in Princeton, New Jersey. atmospheric For humans, and for most animals and plants, the extra 


Normal atmospheric flow, called the Brewer-Dobson dynamics ultraviolet light let in by the ozone hole may not present 
circulation, causes air to rise into the stratosphere over the , a significant problem. The peak ozone loss occurs early 
tropics, and then travel towards the higher latitudes, where influence each in the spring, when the Sun is still low over the horizon, 
it sinks back into the lower atmosphere (and heats up as it other. It's which limits the amount of ultraviolet radiation reaching 


gets compressed). If climate change accelerates that cycle, difficult to keep the surface. 

it will speed up the downward flow above the polar regions, : “If you have to have an ozone hole, the best place is 
which would enhance the compression of the sinking air 4P4 rt what's Antarctica,’ says Solomon. “If it occurred in the tropics it 
and raise atmospheric temperatures there. Especially in pushin ig from would be very damaging to life” 

the Arctic, that heating effect in the polar stratosphere will what's pulling.” But aside from letting in slightly more radiation, the 
impede ozone loss, says Austin. : ozone hole has the indirect effect of altering the local cli- 


With climate change acting both to warm and to cool — Martin mate, which causes atmospheric changes that can be felt 
the polar stratosphere, researchers do not yet know Dameris throughout much of the Southern Hemisphere. Ozone 
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is normally an enormous source of heat 
for the stratosphere because it absorbs 
solar ultraviolet radiation. The absence 
of ozone throughout much of the polar 
stratosphere leads to a6 °C or so cooling there. 
Solomon has found that this cooling effect has 
helped to seal off the Antarctic continent by 
strengthening the vortex of westerly winds 
that flow around the polar cap’. 

This discovery serves as an explanation for 
several puzzling features of Antarctica’ cli- 
mate. The strong vortex locks very cold air on 
the high plateaus in the continent's interior 
and thus shelters the coldest regions on Earth 
from the effects of greenhouse warming. 
Large parts of Antarctica have experienced a 
cooling trend over the past 30 years — a fact 
often raised by those who deny any reality to 
global warming. But Solomon’s work shows 
that ozone loss helps to explain the lack of 
warming in the interior parts of Antarctica. 


Message ina bottle 

With the colder air bottled up over Antarctica, 
it does not flow out as frequently over the 
edges of the continent. That helps to explain 
why the Antarctic Peninsula, exposed to rela- 
tively mild oceanic air, has become one of the 
fastest-warming places on Earth. In the past 
couple of years, two large ice shelves have 
disintegrated along the peninsula, whereas 
six others show signs of retreat, sending gar- 


gantuan icebergs floating off into the Southern Ocean. 
So as the ozone hole shrinks during this century, it will 

uncork the air bottled up over the continent. “The polar 

vortex won't be as tight anymore, and cold interior air could 


Ozone and cancer 


THE TREATY THAT WORKED 


Without the Montreal Protocol and its amendments, 
the amount of ozone-destroying compounds would 
continue to rise, driving up skin-cancer rates. 
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make it to the peninsula more frequently,’ says 
Solomon. “That would help” 

Elsewhere around Antarctica, for instance 
in the Ross Sea, sea-ice coverage is growing. 
Solomon was among the first to suggest that 
stronger atmospheric circulation around Ant- 
arctica may be behind that seemingly puzzling 
trend as well. Indeed, model experiments 
have recently confirmed that the ozone hole 
alters circulation patterns in just such a way to 
explain the sea-ice expansion and the overall 
climate changes in the Antarctic’. 

Looking beyond Antarctica, James Risbey 
of the Center for Dynamical Meteorology 
and Oceanography at Monash University in 
Clayton, Australia, suggests that ozone loss 
is responsible for reduced rainfall and more 
frequent droughts over southern Australia. 
This happens, he says, because the cooling 
caused by ozone depletion over Antarctica 
has lowered atmospheric pressures there, 
speeding up the vortex of winds around the 
continent (see graphic, page 794). That pulls 
the rain-bearing westerly winds southwards, 
keeping them away from Australia. Risbey 
speculates that the ozone loss over Antarctica 
has caused a 20% reduction in rainfall in the 
dry season over Australia. 

“Ozone chemistry and atmospheric dynam- 
ics influence each other. That's why it’s diffi- 
cult to keep apart what’s pushing from what’s 
pulling,” says Dameris. “Much is speculation 


still, and there may be surprises.” 

His experiments with a model that simulates climate and 
atmospheric chemistry suggest that rising sea surface tem- 
peratures in the tropical oceans will strengthen the Brewer- 


Dobson circulation”®. That could result in less ozone inside 
the tropics and more ozone outside, leading to abnormally 
high ozone concentrations in the mid-latitudes. That extra 


ozone would help protect people but it could have unwanted 


With the thinning of 
global ozone layer in the 
past 30 years, a surge in 
skin-cancer rates would 
be expected. And at first 
glance, that seems to have 
happened. The incidence 
of melanoma in the United 
States has more than 
doubled since 1975, with 
almost 69,000 cases 
expected to be diagnosed 
this year. In some parts 
of Australia and New 
Zealand, which have the 
highest incidence and 
mortality of melanoma in 
the world, the numbers 
have jumped by more 
than 50% since 1994. 

However, attributing 
that trend to ozone 


depletion is difficult 
because skin-cancer 
rates depend on other 
factors such as lifestyle, 
sunscreen use and 
how often tumours are 
diagnosed, all of which 
have changed over time. 
It also takes decades for 
skin cancer to appear 
after exposure. So the 
peak in cancer from the 
past thinning of the ozone 
layer has not yet hit. 
Based on predicted 
increases in the amount 
of ultraviolet radiation, 
and assuming that factors 
other than ozone will 
remain unchanged, the 
World Meteorological 
Organization and United 


Nations Environment 
Programme predict that 
additional skin cancer 
cases will top out around 
mid-century at 100 or 
so cases per million per 
year above 1980 levels 
— which would translate 
to at least 700,000 extra 
cases worldwide. Those 
rates should eventually 
return to 1980 values. 
Without the Montreal 
Protocol and its 
subsequent provisions 
on chlorofluorocarbons, 
skin cancer cases would 
have been expected 
to quadruple by 2050 
and to sharply increase 
further thereafter? (see 


graphic, above). Qs. 


effects, perhaps by altering the chemistry of the stratosphere 


or by hindering the growth of plants, says Dameris. 


Over the long term, as chlorine and bromine pollution 
are cleaned from the skies, climate change will take over 
and be the dominant human influence on ozone concen- 
trations, he says. “Not many people have started thinking 


about what that might mean. It’s getting about time” 
Quirin Schiermeier is Nature's Germany correspondent. 


Additional reporting by Richard Monastersky. 
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Ape and human 
similarities can 
be deceptive 


SIR — In his Essay ‘Darwin's last 
laugh’ (Nature 460, 175; 2009), 
Frans de Waal suggests that ape 
vocalizations are homologous to 
human laughter, which they could 
be — but that does not necessarily 
imply that apes have a sense of 
humour. Adaptive divergence 
could be at play, making humans 
the ones who get the jokes. 

Darwin's singular contribution 
to the issues raised by de Waal 
is that, in spite of universal 
appreciation for the behavioural 
similarities between humans and 
other animals, these continuities 
can be divided into two classes. 
The first is a symbolic, or literary, 
relationship, familiar to us through 
Aesop: the ant is industrious, 
the fox is clever. The second is 
the result of a shared biological 
history, or common ancestry. 

Our scientific interest in the 
apes is presumably based on the 
expectation that the similarities 
we can identify between them 
and us are principally of the 
second type. Although critical 
to de Waal’s argument, this 

type of overlap is, unfortunately, 
much easier to assume than to 
demonstrate. 

The circumstances in which 
similarities between chimps 
and humans manifest often 
indicate the opposite — that 
the behaviours are not directly 
homologous and are being 
overenthusiastically interpreted. 
For example, infanticide by chimps 
is carried out by non-relatives (for 
reasons only they know) whereas 
in humans it may be carried out by 
the mother, or by someone acting 
on her behalf, mainly for cultural, 
social or economic reasons. In 
one species, the act is aggressive, 
violent and often cannibalistic, 
with no indication of the remorse 
that can accompany the act in the 
other species. 

And although our feet are 
homologous to ape feet, the 
chimp's foot is principally adapted 
for grasping and the human 
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foot to weightbearing. They are 
similar, and are descended froma 
common ancestral structure, but 
they are by no means the same. 
A genuine Darwinian approach 
to primate behaviour may have 
to acknowledge that the brains of 
apes (and their capabilities) may 
simply be different from our own, 
like their feet. Evolution, after all, 
is the production of difference. 
If one scholar acknowledges the 
adaptive divergence that has 
occurred between a human and 
achimp over 7 million years or so 
of separation, and another insists 
that they are the same, then who 
is really in denial of evolution? 
Jonathan Marks Department of 
Anthropology, University of North 
Carolina at Charlotte, Charlotte, 
North Carolina 28223, USA 
e-mail: jmarks@uncc.edu 


Speed of reporting 
isn't the issue when 
your work is scooped 


SIR — In your Editorial ‘How 

to stop blogging’ (Nature 460, 
152; 2009), you urge the 
conference community to 

be clear from the outset about 
whether their meetings are 
‘open’ or ‘closed’ with respect 
to the informal dissemination of 
their content. In my view (as a 
scientist and a blogger, http:// 
tinyurl.com/mf8ynx), this risks 
treating early adopters of new 
communication technologies 
as scapegoats. 

A few years ago, interested 
conference participants might 
have stood in front of a poster 
and sketched the figures for 
themselves; more recently, they 
might have snapped a digital 
photo while the presenter's 
back was turned. Key information 
could always be relayed by a 
swift phone call back to their 
laboratories. Now people can 
also tweet to the lab or the wider 
‘tweetersphere’. 

We present information in 
order to disseminate it and 
enhance our reputations. 
Whether this occurs within 


weeks, days or minutes is just a 
question of scale. 

If acompetitor can scoop me 
by tweeting or blogging my data 
only a few days or hours faster 
than he or she would have done 
by simply coming to my poster 
and taking ideas back to the 
laboratory, then the problem 
clearly lies with my own inability 
to conclude and publish my 
research — not in the progress 
of technology. 

For the most part, the use of 
this technology is limited to 
sharing information with like- 
minded individuals. The nature 
of the conference has not 
changed, only the mindset of 
the participants. 
lan M. Brooks Biomedical Informatics 
Unit, University of Tennessee 
Health Science Center Clinical and 
Translational Science Institute, 
Memphis, Tennessee 38163 , USA 
e-mail: ibrooks1@utmem.edu 


Readers are welcome to comment 
at http://tinyurl.com/kkfe6e 


European bodies 
can help to tackle 
TB worldwide 


SIR — We welcome your Editorial 
plea (‘Orphan giant’ Nature 

459, 1034; 2009) for stronger 
advocacy for tackling tuberculosis 
(TB), ashared global strategy 
and increased commitment to 
research and innovation. The 
emerging threat of drug-resistant 
TB in Europe, highlighted ina 
recent report from the European 
Academies Science Advisory 
Council (EASAC; www.easac.eu) 
could be an opportunity to bring 
European Union (EU) public- 
health objectives to bear ona 
shared global strategy. 

EASAC was formed by the 
national science academies of EU 
member states to provide advice 
to policy-makers. Its report 
indicates that the EU has failed so 
far to respond adequately to the 
global TB threat, despite now 
being on the frontline for infection 
control. There is still an 
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opportunity for strategic 
leadership: the EU is in a good 
position to draw on its 
considerable strengths in science, 
to develop effective coordinated 
public-health capabilities and to 
contribute to solving the problem 
on a global scale. 

The report recommends, for 
example, that the EU should 
improve its procedures for 
collecting, reporting and using 
its own TB-surveillance data. To 
help understand the molecular 
determinants of virulence and 
resistance, standardized 
characterization of pathogen 
strains and their drug sensitivity 
should be linked into user 
databases of genetic and clinical 
information. Better EU funding 
models for research and 
innovation should eventually 
generate new health-care 
products and services. 

The domestic public-health 
objectives for the EU can be 
directly aligned with global policy 
requirements. For example, 
policy-makers worldwide need 
to extend the ‘advanced market 
commitment’ — under which 
governments and other sponsors 
pay in advance for development 
of vaccines — to cover TB 
diagnostics, drugs and vaccines. 
However, to avoid repetition of 
previous problems caused by lack 
of coordination among multiple 
initiatives, there must also be 
more effective interaction among 
EU institutions, intergovernmental 
and non-governmental 
organizations. 

Robin Fears EASAC, The Royal Society, 
6-9 Carlton House Terrace, 

London SWTY 5AG, UK 

e-mail: robinfears@aol.com 
Alimuddin Zumla Centre for Infectious 
Diseases and International Health, 
University College London Medical 
School, London WIT 4JF, UK 

Volker ter Meulen EASAC and 
German Academy of Sciences 
Leopoldina, Postfach 110543, 

6019 Halle (Saale), Germany 


Contributions to this page may be 
sent to correspondence@nature. 
com. Please see the Guide to Authors 
at http://tinyurl.com/373jsv 
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Africa's biotechnology battle 


An influential book accuses Europe of keeping genetically modified crops out of Africa, but, by polarizing the 
debate, it undermines efforts to improve the continent's agriculture, warn lan Scoones and Dominic Glover. 


Starved for Science: How Biotechnology Is 
Being Kept Out of Africa 

by Robert Paarlberg 

Harvard University Press: 2008. 256 pp. 
$24.95 (hbk), $16.95 (pbk) 


Starved for Science is a troubling polemic. 
Political scientist Robert Paarlberg argues that 
genetically modified (GM) crops could solve 
Africa’s hunger and poverty, but that, through 
inadequate investment, external lobbying 
and stringent regulations, farmers are being 
deprived of the technology and prevented 
from achieving agricultural success. He lays 
the blame largely with European governments 
and non-governmental organizations for trying 
to foist their affluent values and precautionary 
sensibilities on Africa's poor. 

The book has quickly become influential. 
Paarlberg was asked to speak about hunger 
alleviation in front of the US Senate Commit- 
tee on Foreign Relations. The book’s arguments 
were repeated in a major policy speech by Nina 
Fedoroff, science and technology adviser both 
to the US Secretary of State and to the admin- 
istrator of the US Agency for International 
Development (USAID). Economist Paul 
Collier of the University of Oxford, UK, praised 
it in an article in the journal Foreign Affairs, 
and British peer Dick Taverne described it in 
a House of Lords debate as one of the most 
important books he had read in years. 

But Paarlberg’s account is one-sided. Just as 
the heated debate about GM crops had settled 
around a position that recognizes they can 
be useful in some circumstances yet are not a 
panacea, this book unhelpfully polarizes the 
matter once more. 

Large parts of Starved for Science are 
uncontroversial. Paarlberg is correct in say- 
ing that there has been long-term under- 
investment in African agriculture, especially 
in scientific research and technology devel- 
opment. And few would dispute that funding 
agricultural research offers high returns and is 
a key weapon in the fight against poverty and 
hunger. That such arguments have been 
ignored by policy-makers and aid programmes 
is also well recognized. 

Where we take issue with the book is its 
explicit assertion that the only kind of science- 
based agriculture worth investing in is founded 
on biotechnology, and on genetic engineering in 
particular. In its narrow focus, Starved for Science 


EADY 08) (ses 


Transgenic crops are not a panacea in Africa, where low-tech farming methods can yield good results. 


dismisses a slew of scientifically validated 
approaches to agriculture, including integrated 
pest and soil-fertility management, ‘low-input’ 
techniques that reduce reliance on synthetic 
fertilizers and pesticides, and even other forms 
of biotechnology. Yet such methods have per- 
formed well in African contexts (J. N. Pretty et al. 
Environ. Sci. Technol. 40, 1114-1119; 2006). 
Paarlberg’s book reduces the vast and var- 
ied continent and its farmers to a series of 
gloomy generalizations. But a detailed look 
at the data reveals numerous successes. For 
example, in northern and western Africa, 
agricultural production per capita increased 
by more than 40% between 
1981-83 and 2003-05, and 
total output value increased 
by an amount equal to that 
seen in Asia after the 1960s 
Green Revolution (see 
http://tinyurl.com/lgkosx). 
Another success is that of smallholder farmers 
who produce hybrid maize in Zimbabwe and 
Kenya and cassava and cotton in West Africa 
(see http://tinyurl.com/mhaebb). Results can 
seem mixed overall because each technology 
must perform within a particular social, eco- 
nomic, institutional and market setting. 
Paarlberg also pays too little attention to 
the substantial efforts that have got under way 
recently in Africa. International donors have, 
for example, lent their support to two important 
strategic initiatives for agriculture: the Alliance 


over policy. 
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“Players on both sides 
of the GM debate have 
fought a fierce tug-of-war 


for a Green Revolution in Africa and the African 
Union’s Comprehensive Africa Agriculture 
Development Programme. Meanwhile, agri- 
cultural biotechnology has received significant 
backing from within Africa itself — including 
a thoughtful assessment from the African 
Union's high-level panel on biotechnology (see 
http://tinyurl.com/Inc46j). GM crop research is 
also under way in countries as diverse as South 
Africa, Burkina Faso and Malawi. 

Paarlberg argues that GM crops are being 
“kept out” of Africa because European lobby 
groups have forced the imposition of “stifling 
regulations” based on “extreme precaution”. His 
roll-call of bad guys is long, 
from Food First, Green- 
peace and the International 
Federation of Organic Agri- 
culture Movements, to the 
United Nations and the Ford 
Foundation. All are blamed 
for preventing a “science-based escape from 
rural poverty”. 

In fact, the pro-GM lobby has been every 
bit as active, bombarding decision-makers 
and media organizations with slick marketing 
materials and free trips to their corporate head- 
quarters. The US government has sponsored 
schemes in Africa that provide biosafety train- 
ing programmes for regulators and that pro- 
mote model legal frameworks. Such projects 
include the US Foreign Agriculture Service's 
scientific exchange programmes and the 
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Agricultural Biotechnology Support Project, 
part of the USAID Collaborative Agriculture 
Biotechnology Initiative. 

Paarlberg’s claim that external, anti-GM 
views have been the main influence on 
decision-making by national governments 
in Africa is not substantiated. Instead, inter- 
national players on both sides 
of the GM debate have fought a 


"Blue-skies 


or health. An informed ‘wait-and-see’ stance 
thus makes sense. 

What of the future? One of the pivotal argu- 
ments in Starved for Science is that promising 
pipeline technologies and longer-term research 
are also being held back. To make his case, Paarl- 
berg cites the effort to develop drought-tolerant 
GM maize, a major programme 
of the African Agricultural Tech- 


fierce tug-of-war over policy, with research into nology Foundation. Supported by 
African regulators and policy- fut cultural the Bill & Melinda Gates Founda- 
makers often left as unwilling : bas seule ural tion, itis working with a range of 
bystanders. techniques is public and private research and 


Meanwhile, on the ground, 
detailed, site-specific evidence 
on the performance of GM technologies indi- 
cates that a farmer’s ability to reap the poten- 
tial benefits depends on a range of technical, 
agronomic and institutional factors (see http:// 
tinyurl.com/ksbfxo and http://tinyurl.com/ 
krmzxu). For instance, the transgenic trait 
needs to be available in crop varieties that can 
perform in constrained environments. A good 
yield depends heavily on favourable soils and 
irrigation, which the poorest farmers typi- 
cally lack. As the experiences of smallholder 
Bt-cotton farmers in South Africa have dem- 
onstrated, GM crop technology also needs to 
be supported by infrastructure and institutions 
ifit is to benefit the poorest people. 

These findings are in contrast to the 
triumphalism of reports that show the spread 
of GM crops around the world, such as that 
released annually by the International Service 
for the Acquisition of Agri-biotech Applica- 
tions. Studied closely, the 2008 report shows 
that only 8 of the 25 countries that grew 
GM crops planted more than a million hec- 
tares. Almost 80% of the global GM crop of 
125 million hectares was grown in just three 
countries: 62.5 million hectares in the United 
States, 21 million hectares in Argentina and 
almost 16 million hectares in Brazil. More- 
over, the GM crops that have been commer- 
cialized to date are mostly insect-resistant Bt 
varieties of maize and cotton and herbicide- 
tolerant varieties of soya bean, designed for 
and mainly used by large-scale commercial 
farmers. 

African agricultural policy-makers have 
some difficult decisions to make. Biotechnology 
will surely be part of the mix of approaches 
required for the future, as indicated both in the 
World Bank’s 2008 World Development Report 
on agriculture and in the 2008 International 
Assessment of Agricultural Knowledge, Sci- 
ence and Technology for Development. But big 
uncertainties remain — including how farmers 
will gain access to markets where GM prod- 
ucts are currently restricted and the potential 
risks of GM technologies to the environment 
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development organizations. 
This broad initiative involves 
conventional breeding, genomics applications 
and genetic-marker-based selection as well as 
genetic modification. Yet Paarlberg zeroes in on 
the GM solution, maintaining that this is where 
the necessary breakthroughs will happen. 
Blue-skies research into future agricultural 
techniques is essential. But inflating expec- 
tations has major downsides. As occurred 
with medical biotechnology, hype can dis- 
tort innovation. It diverts funds from other 
research and narrows the focus of study to 


genetics rather than taking into account the 
wider environmental, behavioural and syner- 
gistic dynamics (P. Nightingale and P. Martin 
Trends Biotechnol. 22, 546-569; 2004). A simi- 
lar process will occur in agricultural science 
unless we retain a balanced perspective of the 
options available. 

A dogmatic and unscientific stance on GM 
crops — whether for or against — helps no one, 
least of all African farmers. A more evidence- 
based approach than Paarlberg’s is needed — 
one that should foster diverse development 
pathways for agriculture underpinned by 
high-quality scientific research and attuned to 
particular circumstances. a 
lan Scoones is a professorial fellow at the Institute 
of Development Studies and co-director of the 
ESRC STEPS Centre, University of Sussex, Brighton 
BN1 ORE, UK. Dominic Glover is a postdoctoral 
researcher in the Technology and Agrarian 
Development Group, Wageningen University, 
6706 KN Wageningen, the Netherlands. 
e-mail: i.scoones@ids.ac.uk 


For additional references, see http://tinyurl.com/ 
Ifp2x3. 


The world ina grain of sand 


Sand: A Journey through Science and the 
Imagination 

by Michael Welland 

Oxford University Press: 2009. 352 pp. 
Blow? 


On an island off the southwestern coast of 
Turkey, called Sedir Adasi, lies a stunning 
stretch of white sand known as Cleopatra’s 
Beach. According to legend, Cleopatra's lover 
Mark Antony made the beach for her as a 
lavish gift by shipping bargeloads of sand 
from Egypt to the island some 2,000 years 
ago. There may be a grain of truth in this story, 
writes Michael Welland in Sand — the “exotic 
creamy white oolith” granules occur nowhere 
else on the island but match those on Egyptian 
coastal beaches west of Alexandria. 

A modern mystery of sand is the origin 
of the silica used in making silicon for the 
electronics industry. After much research, 
Welland drew a blank. The handful of tech- 
nology companies who dominate this crucial 
market would not disclose the source of their 
raw material; all the author could deduce was 
that silicon chips are made from “sand that 
has already been ultrapurified by nature: 
quartzite”. 
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A popular book by a long-time professional 
geologist could easily have been worthy but 
dry. Yet Sand is serious and entertaining; it is 
the work of someone who has been in love with 
the stuff since he built sandcastles as a child. 
Nothing like it has been published before, even 
by the larger-than-life pioneer of sand stud- 
ies, Ralph Bagnold, who serves as Welland’s 
inspiration. 

Bagnold wasa British army officer who saw 
the Sphinx excavated in the 1920s from tonnes 
of preservative sand laid down over millen- 
nia. He became a desert explorer in Egypt 
and a formidable adversary behind enemy 
lines in the Second World War. He was also a 
scientist, and built a wind tunnel at Imperial 
College London to investigate the behaviour 
of sand grains under controlled conditions. In 
1941 he published The Physics of Blown Sand 
and Desert Dunes, a classic study (see Nature 
457, 1084-1085; 2009) that was later used by 
NASA to plan how its rovers should deal with 
Martian sand. 

But Welland stretches beyond Bagnold and 
his deserts. Sand begins on the small scale, 
with the counter-intuitive physics of granular 
behaviour. Strangely, dry sand behaves like a 
liquid, whereas damp sand is more like a solid, 
provided that it is not too wet. Welland explains 
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the stability and collapse of sand piles, 
and the forensic study of sand to solve 
murders. This leads to a discussion 
of the weird and wonderful micro- 
scopic life forms — such as rotifers, 
tardigrades, gastrotrichs, turbellarians 
and others — that have evolved to 
flourish in the spaces between shifting, 
abrasive sand grains. Such an environ- 
ment may become the last refuge for 
life in the event of catastrophic climate 
change. Welland then addresses sand 
on a progressively larger scale: along 
the course of meandering rivers, in the 
formation of beaches, in deserts and 
mid-ocean sandbanks, in the forma- 
tion of Old Red Sandstone rocks found 
extensively from the Arctic to the Gulf 
of Mexico and, finally, in sand found 
beyond Earth. 

Welland asks how sand grains have 
helped humans to conceive the Uni- 
verse and the infinite. He begins with 
Archimedes who, in the third century 
BC, calculated that 10° grains of sand 
would fill the Universe to the outermost 
sphere of the fixed stars. The author also 


Green desert glass from Libya was carved into a scarab beetle for 
Tutankhamun’'s necklace — but how did the silica glass form? 


with a genuine mystery. In 1922, the 
discovery of Tutankhamun’s tomb in 
Egypt yielded a famous necklace with 
a scarab beetle carved from a glowing, 
yellow-green, gem-like material, which 
its discoverer Howard Carter did not 
recognize. In the 1990s, the material 
was shown to be a unique silica glass, 
28 million years old and 98% pure, 
from a particular part of the Libyan 
desert. 

Welland travels to this desolate spot 
and cherishes the glassy samples he 
finds glittering on the dunes. But, he 
muses, what could have produced heat 
that was intense enough to fuse silica? 
A strike from a meteorite or lightning 
can be ruled out because of the lack 
of visible impact craters or hollow 
fulgurite tubes, respectively. He spec- 
ek  ulates that the cause might have been 
an air burst from the impact of an 
asteroid with the atmosphere, similar 
to that at Tunguska in Siberia, Russia, 
in 1908. 

With irresistible ideas such as this, 
Welland provides an appealing blend 


discusses, and attractively illustrates, 

how sand has been used artistically in many 
cultures — from sand painting by Australian 
aboriginal communities and the North Ameri- 
can Navajo, to Zen sand gardens in Japan and 
the sand sculptures created by digital methods 
at the Media Lab at the Massachusetts Institute 


of Technology in Cambridge. One fascinating 
photograph shows the ‘Earthquake Rose; the 
pattern made by a desktop toy, a sand-tracing 
pendulum, during a strong earthquake in 
Washington state in 2001. 

A personal epilogue provides the reader 


of science and the imagination, worthy 
of the famous vision of the poet William Blake: 
“To see a world in a grain of sand”. a 
Andrew Robinson is the author of Earthshock and 
The Story of Measurement. He is a visiting fellow of 
Wolfson College, Cambridge CB3 9BB, UK. 
e-mail: ar471@cam.ac.uk 


D-Day forecast fictionalized 


Turbulence 
by Giles Foden 
Faber and Faber: 2009. 368 pp. £16.99 


Fluid dynamics and weather prediction seem 
unpromising material for a gripping story. 
But Giles Foden has ingeniously dramatized 
what is perhaps the most important weather 
forecast ever made: that for the D-Day land- 
ings, the invasion of continental Europe at 
Normandy by the Allied forces towards the 
end of the Second World War. The result is 
a compelling tale, with meteorologists as the 
unlikely heroes and the turbulence of the title 
providing the central metaphor. 

General Dwight D. Eisenhower, in com- 
mand of the operation, had to be sure that the 
crossing of the English Channel would not 
be disrupted by bad weather. And he needed 
that assurance five days in advance — a length 


of time that stretches today’s forecasting 
techniques to their limit, and which was 
beyond the capability of meteorologists in 
1944. Add to that the need for a low tide to 
evade the German sea defences, the task con- 
fronting the Allies’ weather experts seemed 
insurmountable. 

In Turbulence, Foden tells this story through 
the eyes of a fictional character, Henry 
Meadows, a young academic attached to the 
forecasting team that is led, as it was in reality, 
by the British meteorologist James Stagg. The 
decision-making process of Stagg and his frac- 
tious colleagues, including the brash American 
entrepreneur Irving Krick and the arrogant but 
astute Norwegian Sverre Petterssen, occupies 
the last third of the book. Stagg and Petterssen 
each published their own accounts in the 
1970s. Although Foden’s tale is steeped in 
that history, he allows Meadows to make the 
crucial, unrecognized contribution. 
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The story begins with Meadows being sent to 
rural Scotland to glean clues about forecasting 
from the leading authority of the day, the 
difficult genius Wallace Ryman. Ryman is a 
fictionalized version of Lewis Fry Richardson, 
known for his work on fractal coastlines, who 
Foden rightly calls “one of the unsung heroes 
of British science”. Like Richardson, Ryman is 
a Quaker whose experiences in the Friends’ 
Ambulance Unit during the First World War 
convinced him that war must be avoided. 
He shuns collaboration with the military, so 
Meadows must pursue his mission by stealth 
—an attempt that he mostly bungles. 

In Scotland, Meadows runs into the second 
wayward genius in the book, this time without 
a pseudonymous disguise: Geoffrey Pyke, the 
man behind the Habbakuk project to build air- 
craft carriers out of ice reinforced with wood 
pulp. This ‘Pykrete’ is extraordinarily resistant 
to impacts and melting. Also making a fleeting 
appearance is Julius Brecher, a doppelganger 
for biochemist Max Perutz, who assisted Pyke 
during the war. This part of the plot would seem 
far-fetched if you didn’t know that it is true. 
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Yet the Habbakuk story, however 
entertaining, seems tacked on. It is not 
central to the plot even though it supplies 
a framing device: Meadows recounts his 
wartime exploits while on board an ice 
ship built in 1980 for an Arab sheikh. 
When Meadows joins Pyke in wartime 
London only to see the Habbakuk project 
terminated, it feels like a cul-de-sac. 
One could carp at a few other points of 
creaky plotting or narrative, but that 
seems churlish given how splendidly the 


Secret wartime plans included building giant ships from ice. 


But Foden has a motive in shaping 
Meadows this way, using him to capture 
a sense of the dour, buttoned-up character 
of wartime Britain. And unlike so many 
fictional scientists, Meadows is believ- 
able: his discourses on turbulence and 
hydrodynamics are assured, even uncom- 
promising, without the breezy ‘beginner’s 
guide’ flavour that is the usual hallmark of 
undigested authorial research. Here Foden 
had the immense benefit of advice from 
his father-in-law, Julian Hunt, a leading 


book animates a buried story of scientific 
endeavour and triumph. 

One must also ask whether the author 
succeeds in creating scientists who are fully 
fleshed individuals. Foden complicates his task 
by making Meadows callow and withdrawn 
as a result of a childhood trauma in Africa. 


Meadows'’s fixation on his research and his 
social awkwardness could make him a cari- 
cature of the diffident scientist. Brecher also 
refracts everything through the prism of his 
own research topic of blood, while Ryman is 
the crabby boffin and Pyke the dotty one. 


expert on turbulence and meteorology and, 
fittingly, a recipient of the Lewis Fry Richard- 
son medal for nonlinear geophysics. Skilfully 
balancing fact and fiction, Turbulence is 
dramatic, intelligent and convincing. a 
Philip Ball is a freelance writer. His most recent 
books form a trilogy called Nature's Patterns. 


Bling of the Bactrians 


Afghanistan: Hidden Treasures from the 
National Museum, Kabul 

Metropolitan Museum of Art, New York 
Until 20 September 


Desire for gold has driven people out of their 
homes and out of their minds. So Soviet 
archaeologist Viktor Sarianidi cannot have 
been surprised when a crowd gathered to 
gawk at a mound he was excavating in north- 
ern Afghanistan in 1978. Tillya Tepe, the Hill 
of Gold, dates to the first century aD, when 
the land was known as Bactria, and contained 
the graves of six nomads — a chieftain and 
five women — buried with more than 20,000 
golden and bejewelled belongings, 
some of which are now on show at 
the Metropolitan Museum of Art 
in New York. 

Sarianidi sent the treasures to 
the National Museum in Kabul and 
returned to Moscow. Then came 
the wars. The Soviet army invaded 
Afghanistan in 1979, the National 
Museum was nearly destroyed by 
shelling in 1994, and in 2001 it 
was ransacked by the Taliban. Yet 
museum guards had hidden the 
treasures in secret vaults in the 
presidential palace and kept their 
location secret for some 25 years. 

Afghanistan portrays the tre- 
mendous challenges of preserving a 
country’s heritage in the face of war. 
“All of these artefacts were supposed 


and archaeologist Fredrik Hiebert, who was 
invited by the Afghans in 2003 to catalogue 
the crates when they were rediscovered. “Every 
time we opened a box, it was like a miracle.” 
Along with the Tillya Tepe gold, the guards 
had concealed nearly all of the treasures from 
Kabul’s museum. 

The exhibition displays artefacts from 
four archaeological sites, each focusing on a 
major stage or civilization in Afghanistan’s 
history. They reveal the multiple influences 
of the Roman, Indian, Greek and Chinese 
cultures that infiltrated the ancient nation. A 
second-millennium gold bowl unearthed at 
the Bronze Age settlement of Tepe Fullol in 
the northeast of the country is decorated with 


bearded bulls, a motif from distant Mesopota- 
mia. The third-century-Bc Greek-style city of 
Ai Khanum, founded after Bactria’s conquest 
by Alexander the Great, yielded two sophis- 
ticated sundials: one, carved from limestone 
in the form ofa throne balanced on two lions’ 
legs, was designed for Ai Khanum’s latitude; 
the other, a unique cylindrical design, was 
calibrated for ancient Syene (Aswan) in Egypt. 
Another mechanical marvel comes from 
Begram, a city on the Silk Road that thrived 
in the first and second centuries ap. A green 
bronze basin is filled with sinuous metal fish, 
their moveable fins and tails wired to small 
weights that would make them ‘swim’ when 
the bow was filled with water. 

But the gold of Tillya Tepe is the most allur- 
ing. Its pastoralist owners ploughed their profits 
from sheep- and goat-herding into shimmer- 
ing trophies ornamented with sym- 
bols of diverse cultures: Aphrodite 
with an Indian dot on her fore- 
head, a dagger handle topped with 
a Siberian bear, Chinese-style boot 
buckles, Roman coins, resplend- 
ent gold jewellery and a folding 
gold crown composed of five ‘trees’ 
adorned with rosettes and birds. 
That such valuables survived “is an 
amazing testament to the Afghans’ 
ability to keep a secret’; Hiebert says. 
“The Communists came through, 
the mujahedin came through, the 
Taliban came through, and these 
poor underpaid museum people 
didn’t tell them, ever. I want to tell 
that story to my children” a 
Josie Glausiusz is a writer based in 
New York. 


to have been lost,’ says curator Museumstaff hid this folding gold crown from Afghanistan's looters for years. e-mail: jg@planetjosie.net 
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A tight squeeze 


Henry Cohn 


How can identical particles be crammed together as densely as possible? A combination of theory and 
computer simulations shows how the answer to this intricate problem depends on the shape of the particles. 


We all know from experience with luggage just 
how difficult it is to pack objects efficiently 
into a limited space. These difficulties are even 
greater when huge numbers of objects, such 
as grains of wheat, are involved. From Luke 
the evangelist’s reference to “a good measure, 
pressed down, shaken together and running 
over” all the way to modern disclaimers that 
contents may have settled during shipping, 
nobody has been able to analyse how to achieve 
the densest possible packings. On page 876 of 
this issue, Torquato and Jiao describe! remark- 
able computer-simulation results that show 
how subtle this problem can be, while offering 
new hope for understanding important cases. 

Identical, perfect spheres are among the 
simplest objects to pack. It is not difficult to 
guess how — just look at the way cannonballs 
are stacked at war memorials. But theoretical 
analysis of the problem presents profound dif- 
ficulties that were overcome only recently by 
Thomas Hales’, nearly four centuries after the 
answer was conjectured by Johannes Kepler’. 
Of course, most granular materials do not con- 
sist of spherical grains, and this complicates 
matters tremendously. For most grain shapes 
we cannot guess or even closely approximate 
the answer, let alone prove it, and it is difficult 
to develop even a qualitative understanding of 
the effects of grain shape on packing density. 

Instead of perfect spheres, Torquato and 
Jiao study’ packings of the five Platonic solids: 
the tetrahedron, cube, octahedron, dodecahe- 
dron and icosahedron (see Fig. 1 on page 877). 
These are the simplest and most symmetrical 
polyhedra. Needless to say, nobody expects the 
grains in physical materials to have these pre- 
cise shapes, but they are a beautiful test case for 
understanding the effects of corners and edges 
and the role of symmetry. 

The cube-packing problem is easy — cubes 
can fill space completely. But the densest 
packings of the other Platonic solids are much 
less obvious. They do not tile space — fill 
space with no gaps or overlaps — despite Aris- 
totle’s incorrect assertion’ that tetrahedra do. 
In their simulations, Torquato and Jiao find 
a striking difference between two cases: the 
octahedron, dodecahedron and icosahedron 
have central symmetry (that is, each remains 


Figure 1| Two-dimensional view of packings. a, In a two-dimensional Bravais lattice packing, there is one 
particle per lattice cell, and the particles are all aligned with each other. In this illustration, the particle 

is represented by a triangle and is located in the upper left corner of the cell. b, In a general, non-Bravais 
lattice packing, there are multiple particles per cell, and they can be arbitrarily rotated. In their computer 
simulations, Torquato and Jiao’ show that the best packings of centrally symmetric Platonic solids turn 
out to be Bravais lattice packings, which is remarkable given how restricted such packings are. 


unchanged by a 180° rotation about a point at 
its centre). But the tetrahedron does not have 
central symmetry, and this turns out to be the 
crucial distinction. 

In the centrally symmetric cases, Torquato 
and Jiao show that the highest-density pack- 
ings belong to the simplest kind, called Bravais 
lattice packings (Fig. 1a), although this con- 
straint is never directly imposed on their 
simulations. In such arrangements, all the par- 
ticles are perfectly aligned with each other, and 
the packing is made up of lattice cells that each 
contain only one particle. The densest Bravais 
lattice packings had been determined previ- 
ously *°, but it had seemed implausible that they 
were truly the densest packings, as Torquato 
and Jiao’s simulations and theoretical analysis 
now suggest. By contrast, for the tetrahedron it 
has long been known that Bravais lattice pack- 
ings are far from optimal, and in this case the 
authors achieve a record density: they find a 
non-Bravais lattice configuration (Fig. 1b) 
that fills up 78.20% of the space available 
(an improvement over the previous record’ of 
77.86%, or 36.73% for Bravais lattices*). 

To find their packings, Torquato and Jiao 
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use a powerful simulation technique. Starting 
with an initial guess at a dense packing, they 
gradually modify it in an attempt to increase 
its density. In addition to trying to rotate or 
move individual particles, they also perform 
random collective particle motions by means 
of deformation and compression or expansion 
of the lattice’s fundamental cell. With time, the 
simulation becomes increasingly biased 
towards compression rather than expansion. 
Allowing the possibility of expansion means 
that the particles are initially given consider- 
able freedom to explore different possible 
arrangements, but are eventually squeezed 
together into a dense packing. 

The new tetrahedron packing is a variant of 
an ingenious construction found by Chen last 
year’. Using physical models, she observed that 
tetrahedra pack well when arranged in a form 
Torquato and Jiao call ‘wagon wheels’: wheels of 
five tetrahedra sharing an edge, with the wheels 
joined in pairs at right angles (see Fig. 3a on 
page 878). How best to arrange these pairs of 
wagon wheels is not clear, but Chen used a 
computer algebra system (a software program 
that manipulates mathematical formulae) 
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to optimize their placement and achieved a 
density of 77.86%, which is a vast improve- 
ment over the previous record’ of 71.75%. 
The authors’ simulations’ suggest that Chen’s 
solution was nearly, but not quite, optimal. 
Although Torquato and Jiao’s improvement 
on Chen’s packing is noteworthy, perhaps the 
most intriguing implication of their work is the 
apparent optimality of Bravais lattice packings 
for the centrally symmetric Platonic solids (as 
well as generalizations such as Archimedean 
polyhedra). This part of the work may seem 
less exciting, because the densest packings 
turned out to be known already. However, ina 
field with few clear organizing principles, this 
latest insight into the part played by symmetry 
might take on the role of a twenty-first-century 
Kepler conjecture. It will surely inspire 
many future research papers, and with any 


luck we won't have to wait 400 years for a full 
understanding of it. a 
Henry Cohn is at Microsoft Research New 
England, One Memorial Drive, Cambridge, 
Massachusetts 02142, USA. 

e-mail: cohn@microsoft.com 
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STEM CELLS 


Escaping fates with open states 


Robert J. Sims II and Danny Reinberg 


The ability of embryonic stem cells to give rise to any cell type relies on 
a remodelling protein that maintains open chromatin. But the chromatin 
landscape of these cells may be more complex than previously thought. 


Embryonic stem (ES) cells are ideal models for 
studying the molecular principles that dictate 
cell fate. ES cells can self-renew and form every 
type of cell in an organism — a property called 
pluripotency. It is well established that DNA- 
binding proteins set the stage for determining 
cell-type specificity by orchestrating complex 
patterns of gene expression’. However, DNA 
is tightly assembled with accessory proteins 
into a structure called chromatin, which 
limits its accessibility to regulatory proteins. 
Chromatin can be loosely packed, or ‘open, 
allowing DNA-binding proteins ready access 
to the genome. Alternatively, it can be densely 
compacted, or ‘closed, making the DNA rela- 
tively inaccessible’. On page 863 of this issue, 
Gaspar-Maia et al.’ provide evidence that 
chromatin structure is intimately connected 
with cell fate by showing that highly accessible 
chromatin is essential for the unique properties 
of stem cells. 

ES cells have to achieve a delicate balance 
of gene regulation: they must suppress genes 
that result in premature differentiation, while 
maintaining expression of genes that allow 
self-renewal. Previous work*” has shown that 
ES cells maintain an open chromatin structure 
and express a large proportion of their genes. 
Differentiation of ES cells into mature cell 
types, such as neural cells, is accompanied by 
closing of chromatin’ and by widespread gene 
silencing®. As chromatin structure strongly 
dictates whether or not genes can be expressed, 
Gaspar-Maia and colleagues* examined the 
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role of enzymes that can modify this structure 
in keeping ES cells pluripotent. 

The authors found that decreasing the 
amounts of a chromatin-remodelling protein 
called Chd1 impaired ES-cell proliferation 
and reduced expression of the DNA-binding 
transcription factor Oct4, which is a master 
regulator of ES-cell function. Chd1 has previ- 
ously been shown to facilitate gene expression’, 
consistent with its role in Oct4 regulation. 
However, in this case’ its depletion did not 
result in a global decrease in gene activity, as 
would be expected for a factor widely associ- 
ated with active gene transcription. Instead, ES 
cells lacking Chd1 upregulated the expression 
of genes involved in nervous-system develop- 
ment, and they tended to differentiate into cells 
of the neuronal lineage’. 

Whether chromatin structure is accessible 
or compact is determined in part by chemical 
modification of its core protein components, 
histones. For example, trimethylation oflysine 4 
on histone H3 (H3K4me3) is a marker of 
open chromatin. Chd1 contains a motif that 
recognizes H3K4me3 (refs 7, 8) and indeed, 
when Gaspar- Maia and colleagues performed 
genome-wide studies of Chd1 localization 
on chromatin in normal ES cells, they found 
that Chd1 co-localizes with H3K4me3. When 
the authors depleted ES cells of Chd1, they 
observed an increase in the proportion of con- 
densed chromatin in these cells. On the basis 
of their results, they surmise that Chd1 main- 
tains an open chromatin structure in ES cells, 
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which is required for their pluripotency. 

The interplay between Chd1, H3K4me3 
and open chromatin (Fig. 1) highlighted by 
Gaspar-Maia and colleagues’ is reinforced 
by findings from other studies” *, H3K4me3 
is incorporated into chromatin during active 
transcription’; therefore, promotion of gene 
expression by Chd1 might ensure that chro- 
matin is rich in H3K4me3 and maintained 
in an open conformation. Chd1 has also 
been shown" to regulate DNA-replication- 
independent deposition of chromatin enriched 
in H3K4me3. Moreover, binding of Chd1 to 
chromatin may protect H3K4me3 from turn- 
over through histone demethylation. This 
histone mark also prevents the binding of 
factors that mediate gene silencing, such as the 
NuRD histone-modifying complex'!”” and the 
DNA methyltransferase subunit DNMT3L”. 
Loss of H3K4me3 would therefore allow 
increased chromatin compaction induced 
by NuRD and DNA methylation, highlight- 
ing the need to maintain a correct balance of 
H3K4me3. In addition, the histone methyl- 
transferase Suv39H_1 facilitates the formation 
of silent chromatin domains by methylating 
lysine 9 on histone H3. Suv39H1 is known 
to act on histones that lack methylated H3K4 
(Fig. 1); thus, open chromatin would serve to 
counteract this repressive activity. 

What Gaspar-Maia and colleagues’ work* 
does not explain is how, despite depletion 
of Chd1, global ES-cell gene activation is 
largely unaffected while genes that drive dif- 
ferentiation towards the neuronal cell fate 
are upregulated. It is possible, though, to rec- 
oncile these findings by taking into account 
Chd1-mediated regulation of the distribution 
of H3K4me3. For instance, the p400/TIP60 
complex, which turns genes on or off by chang- 
ing the composition of chromatin, also seems 
to bind H3K4me3 (ref. 14). When the amount 
of H3K4me3 is reduced in ES cells, bind- 
ing of p400/TIP60 to its chromatin targets 
is impaired”. In this way, the perturbation 
of H3K4me3 in the absence of Chd1 could 
impair gene silencing through its effects on 
factors such as p400/TIP60. Although unusual 
for somatic cells (non-gametes), in ES cells, 
H3K4me3 may be associated with silencing 
of genes at focal locations throughout 
chromatin”. Alternatively, widespread gene 
reactivation could be a consequence of indirect 
effects of the downregulation of ES-cell master 
regulatory factors such as Oct4. 

Studying differentiated cells undergoing 
reprogramming to a pluripotent state’® allows 
insight into the core properties of stem-cell 
chromatin. Cells that fail reprogramming reac- 
tivate subsets of stem-cell-related genes, but 
simultaneously fail to repress lineage-specific 
transcription factors’®. Thus, the correct mix of 
gene expression to achieve stem-cell properties 
may depend more on local alterations in chro- 
matin structure than on global chromatin reor- 
ganization. Gaspar-Maia et al.’ show that Chd1 
loss prevents the reprogramming of somatic 
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Figure 1| Chd1 and open chromatin. The chromatin-remodelling protein Chd1 may maintain open 
chromatin in various ways in ES cells. a, Chd1 facilitates active transcription, which induces the 
placement of H3K4me3 (yellow), a histone mark conducive to open chromatin structure. The 
p400/TIP60 complex supports ES-cell function by binding to H3K4me3 and regulating gene 
expression. b, Chd1 may promote deposition of H3K4me3 in a DNA-replication-independent 
manner, maintaining open chromatin. c, H3K4me3 prevents binding of factors that promote closed 
chromatin and that mediate gene silencing, such as the NuRD histone-modifying complex and the 
DNA methyltransferase subunit DNMT3L. Chd1 binding may also protect H3K4me3 from turnover 
by histone demethylases (HDMs). d, Gaspar-Maia et al.’ show that, on removal of Chd1, chromatin 
in ES cells is more compacted. This may be due to loss of H3K4me3, allowing binding of NuRD 

and DNMTs. Also, the histone methyltransferase Suv39H1 facilitates formation of silent chromatin 
by methylating lysine 9 on histone H3 (red). Suv39H1 preferentially methylates histones that lack 
H3K4me. Compaction of chromatin is associated with ES-cell differentiation, possibly due to the 
silencing of genes that are essential for maintaining pluripotency. 


cells to a pluripotent state. A small imbalance 
in chromatin structure, induced by loss of 
Chd1, might reorganize boundaries between 
open and closed chromatin states; this could 
result in failure to maintain the appropriate 


mix of specific gene expression and gene 
silencing, sending the ES cell down a path of 
differentiation. 

Could the increased amount of closed chro- 
matin observed in ES cells lacking Chd1 (ref. 3) 


stem from focal changes in chromatin at genes 
encoding cell-lineage-specifying factors, rather 
than representing a genome-wide alteration 
in chromatin structure? Current assays do 
not accurately measure the physical proper- 
ties of chromatin structure in cells, and novel 
techniques to analyse higher-order chromatin 
structure in vivo are needed to answer such 
complex questions. It is clear, however, that 
the careful maintenance of an open chromatin 
structure, whether global or local, by factors 
such as Chd1, is crucial for the preservation of 
pluripotency. a. 
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Elementary factors 


Josep Pefuelas and Jordi Sardans 


The identification of a general connection between biogeochemistry and 
the structure of food webs would constitute a considerable advance in 
understanding ecosystems. Ecologists are on the case. 


Organisms are the products of chemical 
reactions, and their growth depends on the 
availability of various elements, especially 
carbon, nitrogen and phosphorus. The 
growth-rate hypothesis’ links the relative ele- 
ment content of organisms to their growth 
rate, the idea being that fast-growing organ- 
isms need relatively more P-rich RNA, the 
main component of the protein-producing 
ribosome, to support rapid protein synthe- 
sis. Consequently, ecosystem conditions that 
produce organic matter with low C:P and 
N:P ratios would be expected to result in 
lower protein:RNA ratios, higher adaptive 
growth rates, more efficient energy transfer 
through a food web, and increased biomass of 


large-bodied animals relative to that of small- 
bodied animals’. In Global Change Biology, 
Mulder and Elser’ offer a first test of this associ- 
ation between C:N:P stoichiometry and faunal 
size spectrum for soil food webs. 

Elser et al.’ had previously proposed the 
growth-rate hypothesis (GRH) to explain dif- 
ferences in C:N:P ratios among zooplankton 
species; support for the hypothesis in that 
context soon appeared’. Subsequent inves- 
tigations”* revealed significant relationships 
between body C:N:P stoichiometry and 
growth rate in diverse groups of organisms. 
The scaling up of studies to the community 
level related the elemental stoichiometries of 
water environments and of body composition 
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to the body-size spectrum of food webs in 
aquatic ecosystems’. But investigations of such 
relationships in terrestrial ecosystems are only 
now beginning*. 

Mulder and Elser’ analysed the mass- 
abundance and body-size spectra of all soil 
invertebrates, fungi and bacteria in 22 grass- 
land soils in the Netherlands. They find that 
the higher the availability of P in the soil (the 
lower the C:P ratio), the steeper is the slope 
of the faunal biomass size spectrum — that 
is, the biomass of large-bodied invertebrates 
becomes greater relative to that of small 
invertebrates. This relationship has broad 
implications because soil invertebrates of 
different sizes have different effects on soil 
processes. The relationship connects biogeo- 
chemistry with the structure of food webs, 
and — if it holds true in different settings — 
opens a way to developing more general laws of 
ecosystem structure and dynamics. 

So Mulder and Elser’s results” are promis- 
ing. Nonetheless, the association between 
soil-element stoichiometry and the body-size 
spectrum of the food web might not simply 
result from faster growth of different organ- 
isms in the food web in response to low C:P 
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and protein:RNA ratios, as predicted by the 
GRH. As Mulder and Elser point out, other 
factors may be involved. For example, the shift 
in body-size spectrum might have resulted 
from the change from one main food resource 
(bacterial cells) to another (fungal remains) 
that Mulder and Elser found in their soils as 
C:P ratios decreased. The shift between these 
two different energy channels’, which differ 
in both productivity and turnover rate, could 
have driven the shift in invertebrate domi- 
nance from bacterium-feeding nematodes to 
microarthropods, which have different feeding 
habits and sizes. The link between C:P ratio 
and body-size spectrum could result from 
the microfauna (nematodes) coping better 
with P-limited conditions than the mesofauna 
(microarthropods). 

Tests of the GRH are being extended to 
plants, which present a much more variable 
stoichiometry than animals or microbes”. In 
an investigation in pines just published in Ecol- 
ogy Letters, Matzek and Vitousek”’ find no link 
between foliar N:P ratio and growth rate. In 
comparing leaves of the same species growing 
at different rates as a result of different nutrient 
conditions, they found that the faster-growing 
plants had lower protein:RNA ratios but not 
consistently lower N:P ratios; and in compar- 
ing several species with different growth rates, 
they found that there was no relation between 
growth rate and foliar N:P or protein:RNA 
ratios. Their results thus deviate from the 
predictions of the GRH. 

Matzek and Vitousek” conclude that plants 
adjust the balance of protein and RNA to favour 
either speed or efficiency of protein synthesis, 
depending on whether nutrient availability is 
high or low. According to this thinking, the 
balance of protein and RNA is not dictated only 
by the stoichiometry of elements in the leaf, nor 
does this balance, in isolation, dictate leaf N:P 
stoichiometry. In fact, the stoichiometry of 
plants might be decoupled from metabolic 
and physiological needs by storage of nutri- 
ents in vacuoles, obscuring the stoichiometric 
differences that would be expected given 
the protein:RNA ratios observed. Thus, the 
links between N:P stoichiometry and growth 
capacity in plants would seem not to be ruled 
by protein:RNA ratio alone. 

Further assessment of the GRH evidently 
requires many more studies of the effects of 
C:N:P ratios on proteins and RNA, as well as 
on other different metabolic products, and on 
growth rates and body sizes in different taxa 
and ecosystems. The number of variables is 
bewildering, and, as is clear from these two 
papers””', the answers will not be universally 
straightforward. 

Metabolomics analysis” is emerging as one 
way forward, however. Improved analytical 
methods and computerized interpretation 
of large data sets have transformed the task 
of quantifying more metabolic compounds 
— carbohydrates, amino acids and peptides, 
lipids, phenolics, terpenoids, alkaloids and 
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so on — and associating them with the 
corresponding elemental stoichiometries. 
Metabolomics should thus help to interpret 
the response of different groups of organisms 
in allocating resources to growth, storage and 
defence. It may also provide the elemental and 
metabolic budgets for different species along 
gradients from low to fast growth, which would 
allow a better test of the links between C:N:P 
ratio, growth rate and body-size spectrum. 

This is an endeavour that goes well beyond 
assessment of the GRH. Such research will 
improve our understanding of the coupling 
between different levels of biological organi- 
zation, from cellular metabolism to ecosys- 
tem structure and nutrient cycling. In doing 
so, it will help in integrating analysis of the 
pools and flows of chemical elements and 
energy studied in ecosystem ecology, with 
assessments of genetic fitness and the bio- 
chemical products of genomes considered in 
evolutionary ecology. o 
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CANCER 


More than kin and less than kind 


Kevin Shannon and Mignon Loh 


A gene that is found to be mutated in a type of blood cancer exhibits 
properties of both a growth-suppressing tumour suppressor and a 


growth-promoting oncogene. 


Myeloproliferative neoplasms (MPNs) are a 
group of cancers characterized by excessive 
production of one or more types of differenti- 
ated blood cell. Mutations in genes that encode 
components of cellular signalling networks are 
a hallmark of MPNs, and cells from affected 
patients are often abnormally sensitive to 
growth factors called cytokines’. On page 904 
of this issue, Sanada et al.” report that muta- 
tions in the CBL gene, which are found in a 
subset of patients with MPNs, contribute to 
the development of cancer in unexpected and 
complex ways, suggesting that the normal 
protein encoded by this gene both promotes 
and inhibits the growth of blood cells. 

Sanada and colleagues’ performed genome- 
wide studies of bone-marrow samples from 
patients with MPNs to uncover acquired 
genetic changes that might contribute to abnor- 
mal cellular growth. The authors were particu- 
larly interested in detecting a type of genetic 
alteration called acquired uniparental disomy 
(aUPD) that occurs in somatic cells (non- 
gametes). In this alteration, a region of DNA 
on one chromosome is lost in tumour cells, 
but is replaced by a copy of the correspond- 
ing segment from the partner chromosome 
that harbours a mutant gene. This mecha- 
nism can contribute to leukaemia through 
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loss of the normal copy (allele) of a tumour- 
suppressor gene’ or by duplicating genes that 
promote cancer (oncogenes)*”. Because this 
alteration results in replacement of the deleted 
DNA, the region of aUPD eludes discovery by 
conventional techniques. To surmount these 
difficulties, the authors” and other groups®* 
have analysed samples using single nucleotide 
polymorphism (SNP) arrays — a technique 
that has become a valuable tool in cancer 
biology. One of the many advantages of SNP 
analysis is that it can identify aUPD by detect- 
ing a chromosomal region in which the DNA 
is derived from a single parental chromosome 
(so-called loss of heterozygosity). 

Using this technique, researchers have 
detected aUPD spanning a region of chromo- 
some 11 that includes the CBL gene in MPN 
specimens”**, and have gone on to discover 
mutations in this gene. The Cbl protein is 
expressed in various cell types and regulates 
cellular signalling networks by acting as a 
multifunctional adaptor molecule and as an 
E3 ubiquitin ligase’, an enzyme that attaches a 
ubiquitin molecule to growth-factor receptors 
and other cellular proteins. Ubiquitination of 
growth-factor receptors triggers their inter- 
nalization and degradation, thereby reducing 
the signalling cascades that promote cellular 
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Protein picker 


Reporting in Nature Chemical Biology, 
Ngo et al. describe a system for 
cell-selective protein labelling in 
mixtures of cells (J. T. Ngo et al. 
Nature Chem. Biol. doi:10.1038/ 
nchembio.200; 2009). Their 

work is based on the principle 

that proteins can be tracked in 
experiments if they are engineered 
so that some of their constituent 
amino acids contain a tag, such 

as aradioactive label; tagged 

amino acids can be added to cells 

in culture, whereupon the cells 
incorporate them into newly formed 
proteins. The general problem with 
this approach is that all the cells 

in the culture become labelled, 
whereas it might be that only certain 
types of cell need to be tagged. 

Ngo and colleagues’ solution to 
that problem involves non-naturally 
occurring amino acids that contain 
azide (N3) groups in their side 
chains. Azides don't react with 
biological molecules, but under 
certain conditions they do react 
quickly with synthetic molecules 


that contain alkyne groups (which 
have carbon-carbon triple bonds). 
Once incorporated into proteins, 
azide-containing amino-acid 
residues will thus react with 
alkyne-containing fluorescent dyes 
or affinity reagents, so tagging the 
proteins for imaging, detection 
or separation. 
Non-natural amino acids are 
already used for protein labelling, 
but Ngo et al. take the idea further. 
They exploit an azide-containing 
amino acid that is incorporated 
into proteins only by cells that 
express a mutant of the methionyl- 
tRNA synthetase enzyme (which 
is involved in protein synthesis). 
When the authors added this 
amino acid to a co-culture of 
normal and mutant Escherichia 
coli cells, only the mutants were 
subsequently tagged with an 
alkyne-containing affinity reagent 
or fluorescent dye. 
To show that their technique could 
be applied to mixtures of bacterial 
and mammalian cells, Ngo et al. 


infected mouse macrophages with 
E. coli cells that express the mutant 
enzyme, in the presence of the 
azide-containing amino acid. They 
then treated the macrophages with 
a green alkyne-containing dye, 
which tagged only the bacterial 
cells (pictured; macrophages are 
stained orange, and are about 

15 micrometres in diameter). 

The authors were also able to 
specifically tag newly synthesized 
bacterial proteins with an affinity 
reagent, which was then used as 


a handle to isolate those proteins 
from the culture. 

The authors’ approach allows 
the cellular origins of proteins in 
complex multicellular systems to 
be determined. It could therefore 
be handy for isolating proteins from 
pathogens in studies of infections, 
for example, or for identifying 
the complement of proteins of a 
single bacterial species living ina 
community of many other microbial 
organisms. 
Andrew Mitchinson 


proliferation. The missense CBL mutations 
found in MPNs introduce amino-acid substi- 
tutions that disable ubiquitin-ligase activity. 
Consistent with a previous report”, Sanada 
et al.* show that mice lacking the Cbl gene pro- 
duce increased numbers of immature blood 
cells. They also find that Cbl inactivation 
promotes the development of leukaemia in 
mice engineered to express the pro-leukaemic 
BCR-ABL gene. Mutant Cbl proteins inhibit 
ubiquitination of growth-factor receptors 
in blood-cell lines, even in cells that retain a 
normal copy of the CBL gene””, and the authors 
demonstrate that this inhibition is associated 
with prolonged receptor activation and an 
enhanced proliferative response to cytokine 
growth factors. Although these studies pro- 
vide evidence that CBL is a tumour-suppressor 
gene, Sanada and colleagues’ data’ also suggest 
amore complex role for CBL in leukaemogen- 
esis. For example, if mutation of a single CBL 
allele is sufficient to disrupt normal ubiquitin- 
ligase activity, it is unclear why MPN cells also 
inactivate the normal copy of the gene through 
aUPD. The authors’ findings that the effects of 
the mutant Cbl proteins are more pronounced 
in cells that lack a normal CBL gene suggest 
that other biochemical properties contribute 
to tumour growth. This idea is consistent with 
the observation that mice lacking Cbl do not 
spontaneously develop MPNs, as would be 
expected if the gene acted purely to suppress 


tumour formation. Finally, when the authors 
over-expressed mutant CBL in fibroblasts, these 
cells showed cancerous properties’. 

So what are we to make of CBL? Is it a tumour 
suppressor or an oncogene, or, as Shakespeare 
might have put it, “more than kin and less 
than kind”? On the one hand, there is strong 
selective pressure to delete the normal CBL 
allele in tumour cells, resulting in loss of ligase 
activity that restrains the output of activated 
growth-factor receptors. These are impecca- 
ble credentials for a tumour-suppressor pro- 
tein. However, the blood cells of patients with 
MPNs invariably retain at least one gene that 
encodes a functional, albeit mutant, Cbl pro- 
tein. Furthermore, mutated Cbl proteins seem 
to acquire unexpected growth-promoting 
functions” — this gain-of-function charac- 
teristic is not seen in blood stem cells that 
lack CBL. Such features implicate CBL as a 
bona fide oncogene. 

Tumour suppressor or oncogene? Perhaps 
a solution to this conundrum is that this 
multi-domain protein fine-tunes the growth 
of blood stem cells and progenitor cells by 
simultaneously promoting and restraining 
growth through distinct protein-protein 
interactions. Recent studies offer intriguing 
clues about the potential growth-promoting 
biochemical properties of mutant Cbl proteins. 
First, Sanada et al.’ find that expression of the 
mutant Cbl proteins is associated with aberrant 
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phosphorylation of STAT5, an activator of 
gene transcription. Aberrant phosphoryla- 
tion is a biochemical feature of some types of 
MPN", and although in this report’ it may be 
due to loss of Cbl-mediated ubiquitination of 
cytokine receptors that activate STATS, other 
mechanisms are possible. Second, CBL muta- 
tions and mutations in the oncogene NRAS 
were mutually exclusive in the adult MPN 
patients studied by the authors. CBL muta- 
tions were also found only in specimens from 
children with juvenile myelomonocytic leu- 
kaemia (a type of MPN) without mutations 
in NRAS or KRAS*. As this childhood leukae- 
mia is an MPN in which a hyperactive form of 
the cell-signalling molecule Ras has a central 
role, there is likely to be a connection between 
mutant Cbl proteins and Ras signalling. Data 
suggesting that Cbl regulates Ras trafficking in 
cells are intriguing in this respect’”. The elegant 
functional studies of Sanada et al.’ thus raise 
fascinating questions about the nature of onco- 
genic Cbl-mediated interactions in MPNs and 
how such interactions might be targeted to treat 
these disorders. | 
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MICROBIAL GENETICS 


Love the one you're with 


Joseph Heitman 


Candida albicans is notorious as an opportunistic microbe that causes 
thrush and serious systemic disease. For geneticists, however, it offers 
continuing revelations into the wondrously varied sex lives of fungi. 


Sex facilitates evolution by recombining 
parental genomes and purging deleterious 
mutations. There are myriad forms, but they 
have three steps in common: meiosis, which 
halves the number of chromosomes per cell; 
gamete production; and cell-cell fusion. Sex in 
most animals involves two individuals of oppo- 
site sex, who produce distinct gametes (sperm 
and oocytes). In fungi, sex typically involves 
morphologically similar cells of opposite mat- 
ing type (a and a) that reciprocally produce 
sex-stimulating pheromones. In other cases, 
fungi are self-fertile, and lone individuals can 
reproduce sexually. One well-known exam- 
ple of this is the model yeast Saccharomyces 
cerevisiae, in which a single cell divides, one of 


a Opposite-sex mating in Candida albicans 


the resulting cells switches mating type, and 
mating yields a/a diploid cells, with a double 
complement of chromosomes, that no longer 
mate but undergo meiosis. On page 890 of this 
issue, Alby et al.' report a conceptually similar 
but mechanistically distinct self-mating phe- 
nomenon in another yeast, the pathogenic 
Candida albicans. 

Candida albicans resides in the human micro- 
biota, for example in the gastrointestinal tract, 
and commonly causes infections. For more than 
a century it was thought to be strictly asexual. 
Then the mating-type gene locus was discov- 
ered, and most isolates were found to be a/a dip- 
loids that, like S. cerevisiae, are sterile’. Mutants 
with only one mating type (a/a or a/a) were 
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Figure 1| Opposite-sex and same-sex mating in Candida albicans. This fungus is diploid (2n): most 
strains are heterozygous, having both mating-type gene forms (a/a), and are sterile. a, Opposite-sex 
mating involves loss of heterozygosity, yielding a and a diploid cells that then switch from the white 
type to the opaque type specialized for mating. Cell-cell fusion yields tetraploid (4m) cells in which 
parasexual chromosome reduction without recognized meiosis yields diploid progeny. b, Same-sex 
mating again involves loss of heterozygosity and the white-opaque switch. As identified by Alby et al.’, 
the same-sex path can then be advanced in two ways, both of which involve production of pheromone 
of the opposite mating type that promotes fusion and results in tetraploid a cells. If a cells lack Bar1 
protease, they do not destroy the a-pheromone they produce; and in ménage-d-trois matings, even a 
few a cells can provide a-pheromone. 
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50 YEARS AGO 

Most areas where there is a 

high prevalence of multiple 
sclerosis coincide ina highly 
suggestive fashion with areas 
where glaciation has played an 
important part in providing parent 
material for soils. However, the 
converse is certainly not true: all 
glacial soils cannot be correlated 
with areas where the prevalence 
of multiple sclerosis is high. 
Maps showing the distribution 

of multiple sclerotic cases in 
Northern Ireland, south-eastern 
Ontario, Sweden, and Denmark 
are alike in one respect—they all 
bear a remarkable resemblance to 
maps illustrating the distribution 
of boulders or geochemical 
anomalies in any map prepared 
for the purpose of searching 

in a glaciated area for buried 

ore bodies ... One other point 
appears worthy of note: higher 
than ‘normal’ quantities of lead 
are known to occur in those rocks 
... occurring in areas where the 
prevalence of multiple sclerosis is 
high... It should also be noted that 
anomalous amounts of lead may, 
on occasion, be accompanied 

by anomalous amounts of some 
other elements such as silver, 
barium, magnesium, and fluorine. 
From Nature 15 August 1959. 


100 YEARS AGO 

It is a usual custom in 
pharmacological work to state 
the dosage of drugs as so much 
per kilogram of body-weight 

of animal or man, the subject 

of experiment or treatment. 

Prof. Benjamin Moore points 

out in the Biochemical Journal 

... that this method of stating 
dosage is inaccurate, the dose 

of a drug for two individuals of 
different size, apart from peculiar 
idiosyncrasies, being proportional, 
not to their weights, but to their 
body surfaces, in other words, to 
the two-thirds powers of their 
weights. Thus an adult of 150 Ib. 
weight cannot be given fifteen 
times the dose for an infant of 

10 b., but much more nearly a dose 
only six times as much. It may be 
that it is this principle which limits 
the value of some drugs. 

From Nature 12 August 1909. 
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CHEMICAL BIOLOGY 


Protein picker 


Reporting in Nature Chemical Biology, 
Ngo et al. describe a system for 
cell-selective protein labelling in 
mixtures of cells (J. T. Ngo et al. 
Nature Chem. Biol. doi:10.1038/ 
nchembio.200; 2009). Their 

work is based on the principle 

that proteins can be tracked in 
experiments if they are engineered 
so that some of their constituent 
amino acids contain a tag, such 

as aradioactive label; tagged 

amino acids can be added to cells 

in culture, whereupon the cells 
incorporate them into newly formed 
proteins. The general problem with 
this approach is that all the cells 

in the culture become labelled, 
whereas it might be that only certain 
types of cell need to be tagged. 

Ngo and colleagues’ solution to 
that problem involves non-naturally 
occurring amino acids that contain 
azide (N3) groups in their side 
chains. Azides don't react with 
biological molecules, but under 
certain conditions they do react 
quickly with synthetic molecules 


that contain alkyne groups (which 
have carbon-carbon triple bonds). 
Once incorporated into proteins, 
azide-containing amino-acid 
residues will thus react with 
alkyne-containing fluorescent dyes 
or affinity reagents, so tagging the 
proteins for imaging, detection 
or separation. 
Non-natural amino acids are 
already used for protein labelling, 
but Ngo et al. take the idea further. 
They exploit an azide-containing 
amino acid that is incorporated 
into proteins only by cells that 
express a mutant of the methionyl- 
tRNA synthetase enzyme (which 
is involved in protein synthesis). 
When the authors added this 
amino acid to a co-culture of 
normal and mutant Escherichia 
coli cells, only the mutants were 
subsequently tagged with an 
alkyne-containing affinity reagent 
or fluorescent dye. 
To show that their technique could 
be applied to mixtures of bacterial 
and mammalian cells, Ngo et al. 


infected mouse macrophages with 
E. coli cells that express the mutant 
enzyme, in the presence of the 
azide-containing amino acid. They 
then treated the macrophages with 
a green alkyne-containing dye, 
which tagged only the bacterial 
cells (pictured; macrophages are 
stained orange, and are about 

15 micrometres in diameter). 

The authors were also able to 
specifically tag newly synthesized 
bacterial proteins with an affinity 
reagent, which was then used as 


a handle to isolate those proteins 
from the culture. 

The authors’ approach allows 
the cellular origins of proteins in 
complex multicellular systems to 
be determined. It could therefore 
be handy for isolating proteins from 
pathogens in studies of infections, 
for example, or for identifying 
the complement of proteins of a 
single bacterial species living ina 
community of many other microbial 
organisms. 
Andrew Mitchinson 


proliferation. The missense CBL mutations 
found in MPNs introduce amino-acid substi- 
tutions that disable ubiquitin-ligase activity. 
Consistent with a previous report”, Sanada 
et al.* show that mice lacking the Cbl gene pro- 
duce increased numbers of immature blood 
cells. They also find that Cbl inactivation 
promotes the development of leukaemia in 
mice engineered to express the pro-leukaemic 
BCR-ABL gene. Mutant Cbl proteins inhibit 
ubiquitination of growth-factor receptors 
in blood-cell lines, even in cells that retain a 
normal copy of the CBL gene””, and the authors 
demonstrate that this inhibition is associated 
with prolonged receptor activation and an 
enhanced proliferative response to cytokine 
growth factors. Although these studies pro- 
vide evidence that CBL is a tumour-suppressor 
gene, Sanada and colleagues’ data’ also suggest 
amore complex role for CBL in leukaemogen- 
esis. For example, if mutation of a single CBL 
allele is sufficient to disrupt normal ubiquitin- 
ligase activity, it is unclear why MPN cells also 
inactivate the normal copy of the gene through 
aUPD. The authors’ findings that the effects of 
the mutant Cbl proteins are more pronounced 
in cells that lack a normal CBL gene suggest 
that other biochemical properties contribute 
to tumour growth. This idea is consistent with 
the observation that mice lacking Cbl do not 
spontaneously develop MPNs, as would be 
expected if the gene acted purely to suppress 


tumour formation. Finally, when the authors 
over-expressed mutant CBL in fibroblasts, these 
cells showed cancerous properties’. 

So what are we to make of CBL? Is it a tumour 
suppressor or an oncogene, or, as Shakespeare 
might have put it, “more than kin and less 
than kind”? On the one hand, there is strong 
selective pressure to delete the normal CBL 
allele in tumour cells, resulting in loss of ligase 
activity that restrains the output of activated 
growth-factor receptors. These are impecca- 
ble credentials for a tumour-suppressor pro- 
tein. However, the blood cells of patients with 
MPNs invariably retain at least one gene that 
encodes a functional, albeit mutant, Cbl pro- 
tein. Furthermore, mutated Cbl proteins seem 
to acquire unexpected growth-promoting 
functions” — this gain-of-function charac- 
teristic is not seen in blood stem cells that 
lack CBL. Such features implicate CBL as a 
bona fide oncogene. 

Tumour suppressor or oncogene? Perhaps 
a solution to this conundrum is that this 
multi-domain protein fine-tunes the growth 
of blood stem cells and progenitor cells by 
simultaneously promoting and restraining 
growth through distinct protein-protein 
interactions. Recent studies offer intriguing 
clues about the potential growth-promoting 
biochemical properties of mutant Cbl proteins. 
First, Sanada et al.’ find that expression of the 
mutant Cbl proteins is associated with aberrant 
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phosphorylation of STAT5, an activator of 
gene transcription. Aberrant phosphoryla- 
tion is a biochemical feature of some types of 
MPN", and although in this report’ it may be 
due to loss of Cbl-mediated ubiquitination of 
cytokine receptors that activate STATS, other 
mechanisms are possible. Second, CBL muta- 
tions and mutations in the oncogene NRAS 
were mutually exclusive in the adult MPN 
patients studied by the authors. CBL muta- 
tions were also found only in specimens from 
children with juvenile myelomonocytic leu- 
kaemia (a type of MPN) without mutations 
in NRAS or KRAS*. As this childhood leukae- 
mia is an MPN in which a hyperactive form of 
the cell-signalling molecule Ras has a central 
role, there is likely to be a connection between 
mutant Cbl proteins and Ras signalling. Data 
suggesting that Cbl regulates Ras trafficking in 
cells are intriguing in this respect’”. The elegant 
functional studies of Sanada et al.’ thus raise 
fascinating questions about the nature of onco- 
genic Cbl-mediated interactions in MPNs and 
how such interactions might be targeted to treat 
these disorders. | 
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MICROBIAL GENETICS 


Love the one you're with 


Joseph Heitman 


Candida albicans is notorious as an opportunistic microbe that causes 
thrush and serious systemic disease. For geneticists, however, it offers 
continuing revelations into the wondrously varied sex lives of fungi. 


Sex facilitates evolution by recombining 
parental genomes and purging deleterious 
mutations. There are myriad forms, but they 
have three steps in common: meiosis, which 
halves the number of chromosomes per cell; 
gamete production; and cell-cell fusion. Sex in 
most animals involves two individuals of oppo- 
site sex, who produce distinct gametes (sperm 
and oocytes). In fungi, sex typically involves 
morphologically similar cells of opposite mat- 
ing type (a and a) that reciprocally produce 
sex-stimulating pheromones. In other cases, 
fungi are self-fertile, and lone individuals can 
reproduce sexually. One well-known exam- 
ple of this is the model yeast Saccharomyces 
cerevisiae, in which a single cell divides, one of 


a Opposite-sex mating in Candida albicans 


the resulting cells switches mating type, and 
mating yields a/a diploid cells, with a double 
complement of chromosomes, that no longer 
mate but undergo meiosis. On page 890 of this 
issue, Alby et al.' report a conceptually similar 
but mechanistically distinct self-mating phe- 
nomenon in another yeast, the pathogenic 
Candida albicans. 

Candida albicans resides in the human micro- 
biota, for example in the gastrointestinal tract, 
and commonly causes infections. For more than 
a century it was thought to be strictly asexual. 
Then the mating-type gene locus was discov- 
ered, and most isolates were found to be a/a dip- 
loids that, like S. cerevisiae, are sterile’. Mutants 
with only one mating type (a/a or a/a) were 
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Figure 1| Opposite-sex and same-sex mating in Candida albicans. This fungus is diploid (2n): most 
strains are heterozygous, having both mating-type gene forms (a/a), and are sterile. a, Opposite-sex 
mating involves loss of heterozygosity, yielding a and a diploid cells that then switch from the white 
type to the opaque type specialized for mating. Cell-cell fusion yields tetraploid (4m) cells in which 
parasexual chromosome reduction without recognized meiosis yields diploid progeny. b, Same-sex 
mating again involves loss of heterozygosity and the white-opaque switch. As identified by Alby et al.’, 
the same-sex path can then be advanced in two ways, both of which involve production of pheromone 
of the opposite mating type that promotes fusion and results in tetraploid a cells. If a cells lack Bar1 
protease, they do not destroy the a-pheromone they produce; and in ménage-d-trois matings, even a 
few a cells can provide a-pheromone. 
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50 YEARS AGO 

Most areas where there is a 

high prevalence of multiple 
sclerosis coincide ina highly 
suggestive fashion with areas 
where glaciation has played an 
important part in providing parent 
material for soils. However, the 
converse is certainly not true: all 
glacial soils cannot be correlated 
with areas where the prevalence 
of multiple sclerosis is high. 
Maps showing the distribution 

of multiple sclerotic cases in 
Northern Ireland, south-eastern 
Ontario, Sweden, and Denmark 
are alike in one respect—they all 
bear a remarkable resemblance to 
maps illustrating the distribution 
of boulders or geochemical 
anomalies in any map prepared 
for the purpose of searching 

in a glaciated area for buried 

ore bodies ... One other point 
appears worthy of note: higher 
than ‘normal’ quantities of lead 
are known to occur in those rocks 
... occurring in areas where the 
prevalence of multiple sclerosis is 
high... It should also be noted that 
anomalous amounts of lead may, 
on occasion, be accompanied 

by anomalous amounts of some 
other elements such as silver, 
barium, magnesium, and fluorine. 
From Nature 15 August 1959. 


100 YEARS AGO 

It is a usual custom in 
pharmacological work to state 
the dosage of drugs as so much 
per kilogram of body-weight 

of animal or man, the subject 

of experiment or treatment. 

Prof. Benjamin Moore points 

out in the Biochemical Journal 

... that this method of stating 
dosage is inaccurate, the dose 

of a drug for two individuals of 
different size, apart from peculiar 
idiosyncrasies, being proportional, 
not to their weights, but to their 
body surfaces, in other words, to 
the two-thirds powers of their 
weights. Thus an adult of 150 Ib. 
weight cannot be given fifteen 
times the dose for an infant of 

10 b., but much more nearly a dose 
only six times as much. It may be 
that it is this principle which limits 
the value of some drugs. 

From Nature 12 August 1909. 
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Figure 2 | Same-sex mating in two pathogenic fungi. Unisexual reproduction occurs in both Candida 
albicans (a) and Cryptococcus neoformans (b). These forms of self-fertility may result in population 
features (such as inbreeding and clonality) that are associated with pathogenesis. Given that two such 
evolutionarily divergent pathogenic fungi both undergo same-sex mating, this process is likely to be 
even more general among fungi. The two species are distinguished in that mating in C. albicans is 
parasexual and no spores are produced, whereas in C. neoformans meiosis results in the production of 


infectious spores. 


found to mate in the laboratory or during infec- 
tion’. But the low frequency of mating raised the 
question of whether sex occurs in nature. This 
was resolved by various discoveries” *, includ- 
ing the observation that mating is enhanced a 
million-fold when cells switch from ‘white’ to an 
‘opaque type specialized for mating. 

In opposite-sex mating in C. albicans 
(Fig. 1a), cell-cell fusion produces tetraploid 
cells that have four copies of each chromosome 
(a/a/a/a). These tetraploids undergo a ‘para- 
sexual’ phase’, which restores the diploid state 
and allows limited genetic recombination but 
does not involve overt meiosis and does not 
produce spores. (The parasexual recombina- 
tion that does occur requires Spol11 (ref. 6), 
an enzyme that acts as a recombinase dur- 
ing meiosis in other organisms. Thus, sex in 
C. albicans could involve cryptic meiosis or 
novel parasexual roles for Spo11.) 

Alby et al.' report the discovery of a facet of 
C. albicans reproduction in which cells mate 
with each other via same-sex mating (Fig. 1b), 
rather than with the opposite mating type. This 
observation arose from analysing C. albicans 
mutants lacking barrier protease, an enzyme 
that inactivates a-pheromone. Barrier was 
discovered in S. cerevisiae as a factor that 
interferes with mate signalling when a cells 
producing barrier are interposed between a 
and a cells’. Among other things, we now know 
that barrier is encoded by the BARI gene ina 
mating-type-specific manner, and that it acts 
on a-pheromone, enabling a cells to choose 
mates producing the most pheromone. 

Curiously, C. albicans a cells express genes 
encoding both a- and a-pheromone. Alby 
et al.’ observed that a cell bar! mutants lack- 
ing barrier produce unusual wrinkled colo- 
nies, and that these colonies were responding 
to pheromone. Surprisingly, they also dis- 
covered that a-a cell fusion can occur both 
between bar1 mutants and between wild-type 
and bar1-mutant cells. Crosses with mutant 
strains showed that same-sex mating occurs 
when a-pheromone is produced by one a cell 
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and is sensed by a-pheromone receptors on 
both the producing and the partner a cell. 

Alby etal. found that same-sex mating is less 
efficient than opposite-sex mating, but is read- 
ily detectable, and in some cases is appreciable. 
They also observed same-sex mating between 
wild-type a cells (producing Bar1) in ménage- 
a-trois matings in which an a-cell minority 
provided a-pheromone. Under these condi- 
tions, sufficient a-pheromone may overwhelm 
Barl and trigger a-a cell fusion. Same-sex 
mating yielded tetraploid cells (a/a/a/a) that 
could undergo parasexual chromosome loss 
to diploid progeny. Thus, C. albicans does not 
need both mating-type locus versions for the 
parasexual cycle, in contrast to their require- 
ment for meiosis in S. cerevisiae. 

Do opposite- and same-sex mating occur 
in nature? The original discovery of ineffi- 
cient C. albicans a—a mating fuelled specula- 
tion that it could be a lab artefact: however, a 
complex cellular network has been maintained 
that supports efficient a~a mating. Population- 
genetics studies and identification of mating- 
stimulatory host conditions lend further 
support to the view that opposite-sex mating 
happens in nature” *. That same-sex mating 
occurs most efficiently with mutant and not 
wild-type strains suggested that bar] mutants 
might occur in the population. No natural bar1 
mutants have been identified so far, however, 
and they might arise only transiently. Alter- 
natively, host conditions or protease inhibi- 
tors produced by C. albicans (a-pheromone 
variants) might stimulate same-sex mating 
in nature. Even infrequent same-sex mat- 
ing in ménage-a-trois matings might suffice 
for genetic exchange in skewed mating-type 
populations, such as commensal biofilms on 
host mucosal membranes. 

Although it might seem odd that yeast 
would mate by both opposite-sex/outcross- 
ing and same-sex/inbreeding, S. cerevisiae 
is again instructive. The natural population 
consists of isolates that switch mating type 
and self-mate, and others that cannot switch 
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but are fertile with the opposite mating type. 
Thus, a well-defined species can evidently 
consist of individuals with different modes of 
sexual reproduction. It is striking, however, 
that two closely related yeasts (C. albicans and 
S. cerevisiae) evolved such distinct modes of 
self-fertility. The more evolutionarily divergent 
fission yeast, Schizosaccharomyces pombe, also 
switches mating type and self-mates, like S. cer- 
evisiae, but independently evolved a machin- 
ery to do so. This diversity in forms of sexual 
reproduction illustrates the remarkable plas- 
ticity of fungi, which have evolved to exploit 
different environments. 

A divergent pathogenic yeast, Cryptococcus 
neoformans, also reproduces unisexually*. This 
species has a well-defined a—a sexual cycle, and 
the fact that the natural population is largely 
a unisexual was puzzling. The discovery that 
a cells undergo same-sex mating, yielding dip- 
loid cells that complete meiosis and produce 
spores (Fig. 2), suggested another route by 
which diversity might be maintained*. Popu- 
lation-genetics studies provide evidence that 
same-sex mating occurs in nature, and impli- 
cate the process in an outbreak of Cryptococcus 
infection”””. That same-sex mating arose con- 
vergently in two divergent pathogenic yeasts 
suggests that unisexual reproduction may be 
linked to microbial pathogenesis, and could 
be even more widespread than is currently 
appreciated. 

Transitions in sexual reproduction between 
inbreeding and outbreeding also occur in 
plants and animals. In plants, mutations in 
self-incompatibility loci involved in pollen 
recognition evoke transitions from cross- 
pollination to self-pollination”’. One individual 
can thereby colonize different environmen- 
tal niches, and become more cosmopolitan. 
Parthenogenesis — virgin birth — is restricted 
in mammals by genomic imprinting, but 
remarkable examples have been reported in 
Komodo dragons and sharks!*">, In these cases, 
otherwise sexual organisms reproduce under 
extreme circumstances, possibly enabling the 
colonization of islands or oceanic expanses by 
single individuals. a 
Joseph Heitman is in the Department of 
Molecular Genetics and Microbiology, 

Duke University Medical Center, Durham, 
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MICROBIAL GENETICS 


Love the one you're with 


Joseph Heitman 


Candida albicans is notorious as an opportunistic microbe that causes 
thrush and serious systemic disease. For geneticists, however, it offers 
continuing revelations into the wondrously varied sex lives of fungi. 


Sex facilitates evolution by recombining 
parental genomes and purging deleterious 
mutations. There are myriad forms, but they 
have three steps in common: meiosis, which 
halves the number of chromosomes per cell; 
gamete production; and cell-cell fusion. Sex in 
most animals involves two individuals of oppo- 
site sex, who produce distinct gametes (sperm 
and oocytes). In fungi, sex typically involves 
morphologically similar cells of opposite mat- 
ing type (a and a) that reciprocally produce 
sex-stimulating pheromones. In other cases, 
fungi are self-fertile, and lone individuals can 
reproduce sexually. One well-known exam- 
ple of this is the model yeast Saccharomyces 
cerevisiae, in which a single cell divides, one of 
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the resulting cells switches mating type, and 
mating yields a/a diploid cells, with a double 
complement of chromosomes, that no longer 
mate but undergo meiosis. On page 890 of this 
issue, Alby et al.' report a conceptually similar 
but mechanistically distinct self-mating phe- 
nomenon in another yeast, the pathogenic 
Candida albicans. 

Candida albicans resides in the human micro- 
biota, for example in the gastrointestinal tract, 
and commonly causes infections. For more than 
a century it was thought to be strictly asexual. 
Then the mating-type gene locus was discov- 
ered, and most isolates were found to be a/a dip- 
loids that, like S. cerevisiae, are sterile’. Mutants 
with only one mating type (a/a or a/a) were 
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Figure 1| Opposite-sex and same-sex mating in Candida albicans. This fungus is diploid (2n): most 
strains are heterozygous, having both mating-type gene forms (a/a), and are sterile. a, Opposite-sex 
mating involves loss of heterozygosity, yielding a and a diploid cells that then switch from the white 
type to the opaque type specialized for mating. Cell-cell fusion yields tetraploid (4m) cells in which 
parasexual chromosome reduction without recognized meiosis yields diploid progeny. b, Same-sex 
mating again involves loss of heterozygosity and the white-opaque switch. As identified by Alby et al.’, 
the same-sex path can then be advanced in two ways, both of which involve production of pheromone 
of the opposite mating type that promotes fusion and results in tetraploid a cells. If a cells lack Bar1 
protease, they do not destroy the a-pheromone they produce; and in ménage-d-trois matings, even a 
few a cells can provide a-pheromone. 


© 2009 Macmillan Publishers Limited. All rights reserved 


50 YEARS AGO 

Most areas where there is a 

high prevalence of multiple 
sclerosis coincide ina highly 
suggestive fashion with areas 
where glaciation has played an 
important part in providing parent 
material for soils. However, the 
converse is certainly not true: all 
glacial soils cannot be correlated 
with areas where the prevalence 
of multiple sclerosis is high. 
Maps showing the distribution 

of multiple sclerotic cases in 
Northern Ireland, south-eastern 
Ontario, Sweden, and Denmark 
are alike in one respect—they all 
bear a remarkable resemblance to 
maps illustrating the distribution 
of boulders or geochemical 
anomalies in any map prepared 
for the purpose of searching 

in a glaciated area for buried 

ore bodies ... One other point 
appears worthy of note: higher 
than ‘normal’ quantities of lead 
are known to occur in those rocks 
... occurring in areas where the 
prevalence of multiple sclerosis is 
high... It should also be noted that 
anomalous amounts of lead may, 
on occasion, be accompanied 

by anomalous amounts of some 
other elements such as silver, 
barium, magnesium, and fluorine. 
From Nature 15 August 1959. 


100 YEARS AGO 

It is a usual custom in 
pharmacological work to state 
the dosage of drugs as so much 
per kilogram of body-weight 

of animal or man, the subject 

of experiment or treatment. 

Prof. Benjamin Moore points 

out in the Biochemical Journal 

... that this method of stating 
dosage is inaccurate, the dose 

of a drug for two individuals of 
different size, apart from peculiar 
idiosyncrasies, being proportional, 
not to their weights, but to their 
body surfaces, in other words, to 
the two-thirds powers of their 
weights. Thus an adult of 150 Ib. 
weight cannot be given fifteen 
times the dose for an infant of 

10 b., but much more nearly a dose 
only six times as much. It may be 
that it is this principle which limits 
the value of some drugs. 

From Nature 12 August 1909. 
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Hydrogen-fuelled vehicles 


Louis Schlapbach 


Hydrogen is hailed as a non-polluting synthetic fuel that could replace oil, especially for transport 
applications. The technology to make this a reality — particularly hydrogen-storage materials — has 
been along time coming, but the first commercial vehicles might now be only a few years away. 


Why would hydrogen bea good 
transport fuel? 

Hydrogen is the lightest element in the periodic 
table, and so molecular hydrogen has the 
highest energy-to-mass ratio of any chemi- 
cal. Indeed, that’s why it is used to propel 
rockets, where it is essential to minimize the 
mass of fuel carried. Hydrogen is carbon-free, 
non-toxic, and its thermal or electrochemical 
combustion with oxygen yields nothing but 
energy and water — although its combustion 
in air might generate nitric oxide air pollutants 
in controllable amounts. Another advantage 
is that the main source of hydrogen is water, 
which is essentially an unlimited resource. 


And what are the drawbacks? 

The chief problem is that hydrogen is a gas at 
room temperature, so it takes up an impractically 
large amount of space. The gas must therefore 
be compressed in some way to make it com- 
pact enough for mobile applications. Another 
issue is less obvious: hydrogen is an energy 
carrier, not a source of energy. Unlike genu- 
ine sources of energy (such as solar radiation, 


fossil hydrocarbons, hydropower and nuclear 
power), hydrogen is not naturally available in 
sufficient quantities for widespread use, and 
must therefore be made by splitting water into 
hydrogen and oxygen. Splitting requires energy, 
whether in the form of electricity, light, heat or 
chemicals. But most of that energy is recovered 
when hydrogen is combusted. 


Isn't hydrogen dangerously explosive? 

In practice, no more so than petrol. Peo- 
ple often think of the fire that destroyed the 
hydrogen-filled Zeppelin Hindenburg in 1937, 
and assume that the gas is dangerously com- 
bustible. In fact, the problem with the Hinden- 
burg was its highly inflammable skin. It is true 
that mixtures of hydrogen and air burn over a 
rather broad composition range (mixtures in 
which 4—75% of the volume is hydrogen will 
burn; for comparison, mixtures of only 1-8% 
by volume of petrol in air will burn). Further- 
more, hydrogen is more volatile than petrol, 
and diffuses faster through air. But compara- 
tive safety studies of fuel leaks from petrol- 
and hydrogen-fuelled cars, performed at the 
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University of Miami, clearly demonstrate that 
hydrogen-fuelled cars pose less of a fire risk 
than petrol-fuelled vehicles. 


What must be done for hydrogen to 
become a widely used fuel? 

There are three crucial steps. First, economi- 
cally viable methods must be developed to 
produce large quantities of hydrogen, ideally 
using renewable energy sources. Advanced 
water-splitting techniques that use electric- 
ity and some heat as energy sources are very 
energy-efficient, and might one day fit the bill. 
Second, hydrogen distribution and storage sys- 
tems are required. Existing gas-distribution 
networks in cities and between industrial 
sites could be used to move large quantities 
of hydrogen around, but mobile hydrogen- 
storage vessels and filling stations are clearly 
necessary for transport applications. Several 
prototype hydrogen-filling stations already 
service fleets of hydrogen-fuelled buses, for 
example at Nagoya airport in Japan and in Ber- 
lin. But no countrywide hydrogen infrastruc- 
ture has yet been set up — although Germany 
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has developed a tool (called H2invest) 
for planning such an infrastructure. 
The third step is to develop technolo- 
gies and devices for converting the 
chemical energy stored in hydrogen into 
more useful forms of energy. Currently, 
hydrogen can yield thermal energy in a 
combustion engine or electrical energy ina 
fuel cell (Fig. 1). 


Could petrol-fuelled car engines be 
converted to use hydrogen? 

Yes, by making minor modifications. But 
that’s not a sustainable solution in the 
long term, as combustion engines convert 
chemical energy to mechanical energy 
(via thermal energy) with low efficiency. 
For example, the energy-conversion effi- 
ciency of a typical petrol-fuelled car in every- 
day traffic conditions is little more than 10%. 
Far more efficient would be a combined hydro- 
gen-fuel-cell-electric-motor (Hy-FC-EM) 
system, known as a power train. But the devel- 
opment of an economically viable power 
train has proven far more difficult than was 
expected. 


What are the problems in developing 
power trains? 

There are three main issues. The first is cost 
— fuel cells generally require catalysts that 
contain rare and expensive elements. The 
second issue is that fuel cells have short life- 
times. Nevertheless, a few companies have 
successfully overcome these problems to 
develop potentially useful hydrogen fuel cells 
for transport applications. The remaining 
big hurdle is therefore to find compact, light- 
weight hydrogen-storage systems capable 
of delivering hydrogen gas to a fuel cell at 
approximately room temperature, and at 
pressures not much greater than atmospheric 
pressure. To stimulate research and develop- 
ment in this area, in 2002 the US Depart- 
ment of Energy set quantitative goals for the 
hydrogen content of storage devices (6% by 
mass by 2010, then 9% by 2015). So far, the 
targets haven't been met, and the new US 
government decided to cut its funding of 
hydrogen-storage research, opting instead to 
put more money into developing batteries for 
fully electric vehicles. However, the US House 
of Representatives has subsequently voted to 
restore most of this funding, and the energy 
department has revised the targets to 5.5% by 
mass for 2015. 


Do the targets make sense? 

No — the whole car needs to be optimized for 
mass, not just the fuel tank. In fact, many car 
manufacturers, owners and public agencies 
do not care if 1 tonne or more of mass is used 
inefficiently in heavy vehicles (which might 
weigh upwards of 2.5 tonnes). Concentrating 
efforts on the mass of the fuel tank alone is like 
asking an obese patient to lose weight from his 
left arm to control his total weight. 
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Figure 1| A hydrogen fuel cell. Hydrogen is 
electrochemically combusted in fuel cells to 
produce energy. a, Hydrogen is channelled to 

the anode at one side of the cell, where the H, 
molecules split into hydrogen atoms. b, A catalyst 
lining the anode causes the hydrogen atoms 

to split into hydrogen ions (H") and electrons. 

A polymer electrolyte membrane allows the 
hydrogen ions to pass between the anode and the 
cathode, but the electrons must pass through an 
external circuit to reach the cathode, creating a 
current. c, Oxygen is channelled to the cathode, 
where it reacts with electrons and hydrogen ions 
to form water as the only side product. The water 
flows out of the cell. 


How much hydrogen does a car 

need to store? 

Using today’s technologies, a safe, weight- 
efficient, five-seater car might weigh 1.2-1.5 
tonnes. This would travel at least 500 kilo- 
metres on 30-35 litres of petrol or diesel, with 
the combined mass of the fuel and tank being 
80 kilograms. To travel the same distance 
using a Hy-FC-EM power train, that car would 
need about 5 kilograms of hydrogen. The 
same car fitted with a hydrogen combustion 
engine would need 10 kilograms of hydrogen 
or more. The problem is that 5 kilograms of 
hydrogen at room temperature and pressure 
occupy about 56,000 litres — equivalent to a 
balloon 4.8 metres in diameter. Some method 
of squeezing this volume down by 1,000-fold 
is therefore essential. 


Can't hydrogen be stored at high 
pressure, or liquefied, to reduce 

the volume? 

Yes, but this isn’t practical for hydrogen storage 
in cars. If hydrogen behaved like an ‘ideal’ gas, 
then a pressure of 100 megapascals (100 MPa, 
1,000 times that of atmospheric pressure) 
would be needed to compress it to 1,000th of its 
volume at atmospheric pressure. But hydrogen 
exhibits ideal-gas behaviour only up to about 
10 MPa; at pressures above 50 MPa, it devi- 
ates markedly from such behaviour, requiring 
even more pressure to compress it further. 
Also, the safety issues associated with stor- 
ing the gas at those high pressures become a 
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great concern. Compression of hydrogen 
above 100 MPa is therefore not a sensi- 
ble option. Liquefaction certainly yields 
hydrogen in an extremely energy-dense 
form (as used for space technology), but 
the process requires temperatures below 
-250 °C. Such conditions, and the cryo- 
genic techniques needed to obtain them, 
are demanding — but not infeasible — 
for ordinary petrol stations. The lique- 
faction process, however, also results in 
the loss of about 30% of the chemical 
energy of gaseous hydrogen. 


So what else can be done? 
Hydrogen can be reversibly absorbed or 
adsorbed (accumulated at the surface) 
by certain solid materials, greatly reduc- 
ing its volume. The gas is then released on 
heating or on lowering the hydrogen-gas 
pressure. Two main classes of hydrogen- 
storage material exist, corresponding to two 
mechanisms of sorption: physisorption, in 
which H, molecules are weakly adsorbed 
by the material, but don’t react chemically 
with it; and chemisorption, in which H, 
molecules react with the material’s surface, 
dissociating into hydrogen atoms that are 
absorbed into the bulk, where they form 
metallic, covalent or ionic bonds with the 
material (Fig. 2). 


What sorts of material adsorb 
hydrogen by physisorption? 

Those that have high surface areas per unit 
of mass, such as some graphitic materials or 
high-porosity compounds. At full capacity, 
their hydrogen content can reach about 8% by 
mass. The problem is that, on planar surfaces 
(which occur in many materials) physisorption 
is weak. Temperatures below -200 °C (about 
the temperature of liquid nitrogen) are there- 
fore generally needed to adsorb useful quanti- 
ties of hydrogen. Physisorption is stronger in 
materials that have narrow, curved pores, and 
at surfaces decorated with hydrogen-attracting 
atoms. Zeolites (aluminosilicates) and metal- 
organic frameworks (complexes of metal ions 
and rigid organic molecules) might provide 
these features, raising the hope that hydrogen 
physisorption could one day be achieved at 
room temperature in such materials. 


And which materials use 
chemisorption? 

Hydrogen reacts with many materials — often 
metals or alloys, but also other compounds — 
to form hydrides that can be used as hydrogen- 
storage media. The reactions are often fast and 
reversible at practically accessible temperatures 
(20-100 °C), so that large amounts of hydrogen 
can be absorbed and then recovered. Materials 
that have been extensively studied for hydro- 
gen storage include: hydrides of elemental 
metals (magnesium, palladium, thorium); 
hydrides of compounds that have the for- 
mula AB; (where A and B are usually metals), 
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such as lanthanum-nickel hydride (LaNi;H,); 
and ‘complex hydrides, such as magnesium- 
nickel hydride (Mg,NiH,) or barium-rhodium 
hydride (BaRhH,). 


How well do hydrides perform? 

They easily reach hydrogen-volume densi- 
ties far above that of liquid hydrogen, and 
can be recharged several tens of thousands 
of times. But because the host materials are 
not light, the quantity of hydrogen stored has 
not exceeded 4% by mass. To date, most of 
these compounds aren't practically useful for 
mobile hydrogen storage, as their hydrogen- 
storage capacities per unit mass are either too 
small or the temperatures required to release 
stored hydrogen are too high. But their volu- 
metric hydrogen-storage capacity and their 
safety features are impressive. 


Does that mean hydride research 
hasn't been fruitful? 

On the contrary, there have been many useful 
discoveries from this field. Metal hydrides are 
now used as electrodes for batteries, such as 
those in commercially available hybrid cars. 
These batteries are safe, compact and quickly 
rechargeable. Metal hydrides also form the 
basis of certain temperature sensors in aircraft, 
and might one day be used in intelligent coat- 
ings that control the transparency of windows. 
Furthermore, hydrogen-induced pulveriza- 
tion techniques, discovered during the course 
of metal-hydride research, are now used to 
manufacture permanent magnets. 


What's the latest thing in 
hydrogen-storage materials? 

Current studies focus on solid-state storage 
using complex hydrides of light elements 
(those with low atomic masses), or high- 
pressure tanks (35 MPa) filled with such 
materials. These complex hydrides include 
alanates (which contain AIH,” anions, where 
Al is aluminium); borohydrides (which 
contain BH, anions, where B is boron); amide 
hydrides (which contain lithium, nitrogen and 
hydrogen); and combinations thereof. They 
reach attractively high hydrogen contents of 
8-20% by mass. For these materials, hydride 
formation and decomposition proceed 
over several steps through intermediate 
compounds. Experimental and modelling 
efforts are aimed at fine-tuning these 
reactions, and finding catalysts that speed up 
hydrogen release. Such efforts should yield 
materials that charge and discharge hydrogen 
in a temperature range of 20-100 °C; currently, 
temperatures of 300 °C or more are required. 
But practical applications of these materials 
are probably still a long way off. Another 
approach to hydrogen storage has also recently 
been resurrected, having previously fallen out 
of favour because of poor performance. In this 
approach, liquid hydrocarbons are used as a 
source of hydrogen gas. The process generates 
a dehydrogenated hydrocarbon side product, 
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Figure 2 | Mechanisms of hydrogen storage. a, Materials that have high porosity store hydrogen 
using physisorption, in which the gas molecules accumulate at the surface of the material, but don’t 
react chemically with it. b-d, Certain metals, alloys and other compounds store hydrogen using 
chemisorption. In this process, hydrogen molecules react with the surface of the material (b), where 
they split into separate atoms. Initially, the hydrogen atoms adopt random locations in the host 
material (c). They eventually form metallic, covalent or ionic bonds with the metal atoms to yield a 
hydride compound, in which the hydrogen atoms adopt a regular arrangement (d). 


which could be re-hydrogenated at a recycling 
centre to generate more fuel. 


When will we see hydrogen-fuelled 
cars on the market? 

Hydrogen-powered buses in Germany and 
Japan have already been successful, and about 
50 prototype cars — both thermal-combustion 
and fuel-cell types — have so far been tested, 
with good results. I therefore wouldn't be sur- 
prised to see Hy-FC-EM cars for sale in about 
five years. 


What are the other options for 
powering vehicles? 

Given their current dominance, oil-based 
fuels will undoubtedly be used in vehicles 
for some time to come, although uncertain- 
ties about supply mean that their days are 
probably numbered. New types of recharge- 
able battery — safer lithium and lithium—air 
batteries, and higher-capacity metal-hydride 
batteries — will also raise the market poten- 
tial of fully electric city vehicles and hybrid 
cars. It should be noted, however, that if most 
car owners switched to fully electric cars, 
many countries would need to double their 
electricity production. Similarly, hydrogen 
production also requires energy, such as 
electricity. 


How will hydrogen compete with 

those other power sources? 

This depends on trends in car design and the 
cost-effectiveness of hydrogen technology. 
Hydrogen-fuelled cars are already technically 
feasible, but if they are to be widely used, they 
must also be economically competitive with 
rival models that use other fuels. This will 
probably happen in a few years for smaller 
cars (those weighing about 1 tonne); a trend 
towards lightweight vehicles would therefore 
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help to kick-start the market. But although car 
manufacturers increasingly use lightweight, 
high-strength materials, the average total mass 
of a car has continuously increased over the 
past decade, as vehicles grow in size, power, 
comfort and safety. 


So what are the prospects for 

hydrogen as a fuel? 

A weak trend towards customers preferring 
smaller vehicles does exist, and this will be 
strengthened as car price and fuel economy 
become more important to buyers. New safety 
devices that not only give drivers warnings, but 
also act directly on the car, should decrease the 
probability of accidents, reducing the need 
for heavy protection in vehicles. That would 
open the door to lightweight vehicles and so to 
hydrogen-fuelled mobility. Hydrogen therefore 
certainly has a place in long-term transport 
strategies. But to ensure success, the challenges 
ahead require smart goals and long-term sup- 
port of the best scientists and engineers — not 
‘stop-and-go politics from successive govern- 
ments, as currently seems to be the case. 
Louis Schlapbach is at Empa - Materials 

Science and Technology, CH-8600 Dilbendorf, 
Switzerland, and the National Institute for 
Materials Science, Tsukuba, Japan. 

e-mail: louis.schlapbach@me.com 
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METALLOPROTEINS 


roteins can catalyse a remarkably wide 
range of chemical reactions. Yet the main 
differences among polypeptides are in the 
side chains of naturally occurring amino 
acids, which account for only a small proportion of 
the possible chemical functionality. The diversity 
of function is instead made possible partly because 
proteins can incorporate cofactors — such as small 
organic molecules, single metal atoms or clusters 
that contain metal and non-metal atoms — into their 
active sites. 

Almost half of all enzymes require the presence 
of a metal atom to function. These ‘metalloproteins’ 
have captivated chemists and biochemists, 
particularly since the 1950s, when the first 
X-ray crystal structure of a protein, sperm whale 
myoglobin, indicated the presence of an iron atom. 
Much is now understood about how metal clusters 
are assembled, how metal ions or clusters are 
introduced into target proteins, and which metal 
ions are commonly found in metalloenzymes. In 
addition, we are much closer to understanding the 
mechanisms by which metalloenzymes catalyse such 
a range of complex chemical reactions. But, despite 
more than half a century of research by chemists, 
biochemists and cell biologists, many discoveries 
remain to be made. 

The articles in this Insight highlight some of the 
most exciting current research on metalloproteins, 
including how enzymes containing complex metal 
clusters metabolize small gaseous molecules, how 
proteins containing iron-sulphur clusters are 
assembled, and how metalloenzymes containing a 
single metal ion catalyse the halogenation of small 
organic molecules. 

We are pleased to acknowledge the financial 
support of AstraZeneca in producing this Insight. 
As always, Nature carries sole responsibility for all 
editorial content and peer review. 


Joshua Finkelstein, Senior Editor 
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INSIGHT 


Structure-function relationships of 
anaerobic gas-processing metalloenzymes 


Juan C. Fontecilla-Camps', Patricia Amara’, Christine Cavazza’, Yvain Nicolet' & Anne Volbeda’ 


Reactions involving H,, N,, CO, CO, and CH, are likely to have been central to the origin of life. This is indicated 
by the active-site structures of the enzymes involved, which are often reminiscent of minerals. Through the 
combined efforts of protein crystallography, various types of spectroscopy, theoretical calculations and 
model chemistry, it has been possible to put forward plausible mechanisms for gas-based metabolism by 
extant microorganisms. Although the reactions are based on metal centres, the protein matrix regulates 
reactivity and substrate and product trafficking through internal pathways, specific ligation and dielectricity. 


The most popular autotrophic theory of the origin of life postulates that 
primordial metabolisms developed on mineral iron-sulphur surfaces 
under reducing conditions’. During this period of the Earth’s evolution, 
between 4.6 and 3.5 billion years (Gyr) ago, the atmosphere was probably 
rich in gases such as H,, CO and CO,, and its hot oceans contained rela- 
tively high concentrations of transition-metal ions such as Fe”* and Ni** 
(ref. 2). Similar conditions are found around contemporary hydrothermal 
vents generated by plate tectonics at the ocean floor. Thermophilic metha- 
nogenic Archaea can grow around vents called black smokers, owing to 
the oxidation of expelled H, and the reduction of CO, to CH, (ref. 3). 
This biotope shows that life can thrive independently of solar energy, 
which was a requirement for early organisms before the emergence of 
photosynthesis. The oxidation of H,S by microbial symbionts of vent- 
associated organisms such as tube worms is not discussed here because 
it requires O, and is flavin based rather than metal based. 

Along with H, as an energy vector, CO, was most probably used as 
a carbon source through an ancestral version of the Wood-Ljungdahl 
pathway, which generates acetyl-coenzyme A (CH,C(O)-SCoA) to be 
used in anabolic processes”. Conversely, nitrogenase, the enzyme capable 
of reducing N, to ammonia ina process crucial for life, is thought to have 
originally evolved as a detoxifier, neutralizing the poisonous cyanogen gas, 
(CN),, present in the early atmosphere’. All these reactions took place in 
an anoxic environment, as it was not until about 2.2 Gyr ago that oxygen 
produced by photosynthetic cyanobacteria entered the atmosphere in 
significant amounts’. As a consequence, extant anaerobic H, and/or CO 
oxidizers and CO, reducers are generally confined to remaining anoxic 
settings, for example the bottoms of lakes and oceans and the digestive sys- 
tems of animals. A notable feature of all these gas-based redox reactions is 
that they are catalysed by enzymes containing complex metallic active sites 
often reminiscent of FeS-containing mineral structures such as greigite, 
pyrrhotite, violarite and pyrite. This analogy constitutes the basis for the 
concept of primordial surface metabolism on transition-metal sulphides 
of the ‘iron-sulphur-world theory for the origin of life’. In all the gas- 
based reactions discussed in this Review (Table 1), catalysis seems to be 
metal centred, and with one exception the active site is buried. The role of 
the protein matrix is to determine the specificity of the reaction, provide 
ligands, modulate the redox potential and, in several cases, allow gas dif- 
fusion to and from the active site through hydrophobic tunnels. 

Given the complexity of the active sites, it is not surprising that their 
assembly and insertion into the corresponding proteins typically involves 


a significant number of gene products. The well-studied examples of 
nitrogenase and [NiFe]- and [FeFe] -hydrogenase®” are reviewed in some 
detail. Perhaps the most complicated aspect of gas-based catalysis is the 
elucidation of its mechanism. Often, the joint efforts of spectroscopy, 
X-ray crystallography, computational studies and model chemistry have 
been required to shed light on its nature, and in certain cases there is still 
room for debate. In this Review, we discuss the three known classes of 
hydrogenase, bifunctional CO dehydrogenase (CODH)-acetyl-SCoA 
synthase (ACS), methyl-coenzyme M (CH,-SCoM) reductase (MCR) 
and nitrogenase. Four of these enzymes’ active sites contain nickel, an 
otherwise uncommon element in biology, whereas cuboidal FeS clus- 
ters are almost ubiquitous (see page 831). The complex gas-catalysing 
active sites pose a formidable problem to synthetic chemists. Both bio- 
mimetic and bio-inspired models have been reported. A few examples 
are discussed. 


Active sites 

Figure 1 shows the active sites discussed in this Review. Hydrogen catal- 
ysis is carried out by at least three well-characterized types of enzyme: 
[Fe]-, [FeFe]- and [NiFe]-hydrogenases. Of these, [Fe]-hydrogenases 
are found only in methanogens’. By contrast, [FeFe]-hydrogenases are 
present in bacteria and some unicellular eukaryotes, whereas their [NiFe] 
counterparts are found in bacteria and Archaea’. Although these proteins 
are phylogenetically unrelated, the three active sites contain a Fe(CO),, 


Table 1| A compilation of the various enzyme reactions discussed 


No. Enzyme Reaction 

la Hydrogenases H,=H +H* 

1b H =H*+2e 

Ic H,= 2H*+2e" 

1d Methenyl-H,MPT* +H = methylene-H,MPT 

2 CO CO, +2e +2H*=CO+H,0 
dehydrogenase 

3. Acetyl-SCoA CH;-Co**FeSP + CoAS: + CO=CH;C(O)-SCoA + Co*FeSP 
synthase 


4 Nitrogenase N,+8H*+8e +16ATP=2NH; +H, +16ADP + 16P; 


5 Methyl-SCoM CH;-SCoM+CoBSH=CH,+CoMS-SCoB 
reductase 


P,, inorganic phosphate. 


‘Laboratoire de Cristallographie et Cristallogenése des Protéines, Institut de Biologie Structurale ‘J.P. Ebel’, CEA, CNRS, Université Joseph Fourier, 41 rue J. Horowitz, 38027 Grenoble Cedex 1, 


France. 
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Figure 1| Active sites of gas-processing metalloenzymes. a, [FeFe]- 
hydrogenase; b, [NiFe]-hydrogenase; c, nickel-dependent CO 
dehydrogenase; d, [FeMo]-nitrogenase; e, [Fe]-hydrogenase; f, methyl- 
SCoM reductase; g, acetyl-SCoA synthase; h, nitrogenase FeMo-co (top 
view). Open arrows and open spheres respectively indicate entrance/exit 
sites and available binding sites for gases and other substrates. We simplify 
the protein environments using the following symbols: d, hydrogen-bond 
donor; a, hydrogen-bond acceptor; and n, nucleophile®’. Black and red 
dashed lines respectively indicate putative hydrogen-bond and other 


unit, with n = 1 in [NiFe]-hydrogenase and n = 2 in [Fe]- and [FeFe]- 
enzymes. In addition, [NiFe]- and [FeFe]-hydrogenases have two CN" 
ligands that are terminally bound to one or two iron ions, respectively”, 
and [Fe]-hydrogenase contains a guanylylpyridinol cofactor that binds the 
active-site iron through an sp’-hybridized nitrogen and a formyl carbon 
atom’. The protein contributes only one thiolate ligand to iron coordina- 
tion. In [NiFe]-hydrogenases, protein coordination is mediated by two 
Fe-Ni-bridging and two nickel-binding terminal Cys thiolates''; in some 
enzymes, one of the latter Cys thiolates is replaced by a selenocysteinate””. 
In [FeFe]-hydrogenases, the active site is an [FeFe]-[Fe,S,] unit known as 
the H-cluster. The catalytic [FeFe] subcluster is coordinated by one Cys 
thiolate and a five-atom organic dithiolate ligand with a bridgehead atom 
that could be carbon, nitrogen or oxygen (X in Fig. 1a). Experimental 
studies favour nitrogen’, whereas a theoretical study using active-site 
models favoured an ether but did not rule out an amine”®. The crystal 
structures of the three hydrogenases show vacant ligand sites at the cata- 
lytic metal, which are potentially available for H, binding. In [FeFe]- and 
[Fe]-hydrogenases, substrate binding would complete an octahedral coor- 
dination sphere around the iron ion. In [NiFe]-hydrogenases, the catalytic 
nickel ion has both a terminal coordination site and a bridging coordina- 
tion site available for exogenous ligands, which, if occupied, would also 
result in an octahedral sphere around the metal ion. 

Nickel-dependent CODH” catalyses the reversible reduction of CO, to 
CO (Table 1, reaction 2). Its active site, known as the C-cluster, contains an 
[Fe,S,] unit linked to a spectroscopically unusual iron ion (Fel in Fig. 1c) 
and a nickel ion’*. The protein provides a terminal thiolate ligand to both 
the nickel and Fel and an additional His ligand to Fel. The structure of the 
catalytic A-cluster of ACS consists of a [NiNi]-[Fe,S,] unit’’. The nickel 
ion distal relative to [Fe,S,], Ni,, is bound to two consecutive main-chain 
nitrogen atoms and two Cys thiolates with square-planar geometry. These 
thiolates also bind the proximal nickel ion, Ni,, which is further coordi- 
nated by a Cys thiolate. The latter also binds one of the irons of the [Fe,S,] 
cluster. The proximal nickel ion is most likely involved in catalysis because 
it has two vacant coordination sites available for the binding of reaction 
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weak interactions. Atom colour codes: molybdenum, violet; selenium, 
orange; nickel, green; iron, brown; sulphur, yellow; oxygen, red; nitrogen, 
blue; carbon, white (or grey in the F,,, cofactor). Subscripts p and d 
respectively indicate the proximal and the distal metal ions with respect 
to the [Fe,S,] cluster. Iron ions in FeMo-co are numbered, and inorganic 
sulphur ions are labelled 1A, 2A, and so on. Fel in ¢ corresponds to a 
unique iron ion, and X in d corresponds to a central light atom that could 
be carbon, nitrogen or oxygen. Gp, guanylylpyridinol; Sec, selenocysteine; 
t-Gly, thioglycine. 


intermediates. Thus, the CODH, ACS and [NiFe]-hydrogenase active sites 
are similar: other than nickel, they contain iron, Cys thiolate ligands and 
two nickel cis-coordination sites available for substrate binding”. 

The reduction of N, to ammonia is performed by nitrogenases™ 
(Table 1, reaction 4). The reaction is carried out at a complex cluster, 
the [FeMo]-cofactor (FeMo-co)” (see page 839). The 1. 16-A-resolution 
crystal structure of the enzyme has redefined this cofactor as a [Mo-7Fe- 
9S-X] unit” (Fig. 1d, h). In the resting state of the enzyme observed in 
the crystals, the octahedral coordination sphere of the molybdenum ion 
is saturated by three bridging sulphide ions from the cluster, a bidentate 
homocitrate ligand and a His imidazole. The only other protein ligand 
is a Cys thiolate, bound to the apical iron ion at the other end of the 
cofactor. The other six iron ions are each bound by three sulphides and 
a central atom X, which, on the basis of the electron density and local 
charges, could be C*, N* or O* (ref. 24). There is no obvious coordina- 
tion site for either N, or H, (Table 1, reaction 4). 

Some Archaea use specific cofactors and coenzymes for methano- 
genesis (Table 1, reaction 5) and anaerobic methane oxidation. The 
active site of the enzyme involved in methane evolution, MCR, con- 
tains a porphyrin-related F,,, cofactor, also known as corphin. This is 
the most reduced tetrapyrrole-metal complex found in nature”. Apart 
from binding F.,;9, the nickel ion is axially coordinated by the oxygen of 
a Gln side chain. Trans to the latter, there is a vacant site available for 
substrate and product binding. 

As well as binding the generally buried active sites, the protein envi- 
ronment finely regulates its accessibility for all substrates, including 
protons, coenzymes and gases that diffuse through internal tunnels 
(Table 1). Some of these aspects are further developed below. 


Catalysis 

Possible catalytic mechanisms for the enzymes discussed here are pre- 
sented in Fig. 2. The [FeFe]- and [NiFe]-enzymes are ‘true hydrogenases, 
in that they catalyse the reversible oxidation of molecular hydrogen to 
protons and electrons’””’. However, the [Fe]-hydrogenase only catalyses 
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the first step in the uptake reaction (Table 1, reaction 1a), transferring 
the resultant hydride to methenyl-H,MPT" (Table 1, reaction 1d)”. 
This enzyme is also unusual in that both the active-site iron-binding 
guanylylpyridinol cofactor (Fig. le) and the substrate are found at the 
molecular surface*. The nickel-iron coordination geometry of [NiFe]- 
hydrogenases changes very little between its oxidized inactive and 
reduced active forms'’. A hydride probably bridges the nickel and iron 
ions in the active form”, replacing oxygen species observed in the inac- 
tive species. The competitive inhibitor CO binds terminally to nickel in 
[NiFe]-hydrogenase” and to Fe,, the distal iron ion (relative to [Fe,S,]), in 
[FeFe]-hydrogenase” (arrows in Fig. 1). In [FeFe]-hydrogenase, one of the 
intrinsic CO ligands switches between terminal and bridging positions, 
depending on the redox state of the enzyme and, possibly, on hydride 
binding to the same orbital’’ (Fig. 2c). So far, this switch has not been 
reproduced by calculations using active-site models”. 


The binding of active-site iron to the high-field, m acceptor CO and 
CN’ ligands (Fig. 1) promotes a low-redox state of the metal ion, mak- 
ing it a relatively soft Lewis acid. This, in turn, favours its binding to 
the soft Lewis base H’, a probable step in catalysis (Table 1). The direc- 
tion of the reaction in [FeFe]- and [NiFe]-hydrogenases is governed 
by the intracellular location of the enzyme, the pH and the H, con- 
centration. Previous work by some of us has focused on periplasmic 
hydrogen-uptake enzymes from sulphate-reducing bacteria’. Figure 2a 
shows two possible hydrogen-binding states of the active site of [NiFe]- 
hydrogenase: the partially oxidized electron paramagnetic resonance 
(EPR)-silent Ni-SI form, which may be the initial product of reductive 
activation (blue cycle); and the EPR-active Ni-C state, which has been 
shown to be in redox equilibrium with H, (red cycle)*™. The reaction of 
H, with Ni-C would result in a species, reduced by two more electrons, 
that should be a transition state in the catalytic cycle. An equivalent 
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Figure 2 | Enzymatic catalysis. a, [NiFe]-hydrogenase. Blue and red cycles 
define Ni-SI and Ni-C as H, acceptors, respectively*”’. b, Nitrogenase. 

The blue cycle corresponds to the complete process; it favours iron- 
centred catalysis”. The red cycle shows intermediate states found with a 
molybdenum-based model compound”; M is a metal component of FeMo- 
co. ¢, [FeFe]-hydrogenase. The bridgehead atom is depicted as nitrogen 
because it makes chemical sense and there are several lines of evidence that 
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favour it (see main text). d, Methyl-SCoM reductase. The proton donor to 
form CH, is depicted as Tyr*’. e, Acetyl-SCoA synthase. The two possible 
sequences of substrate binding are depicted by blue (CH, first) and red 
(CO first) arrows. f, CO dehydrogenase. This mechanism is based on the 
proposition in ref. 18 and recent experimental results*®”’. No catalytic 
mechanism for the re-interpreted active-site structure of [Fe]-hydrogenase® 
has been proposed. 1’ and 2’ correspond to alternative pathways. 
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Figure 3 | Enzyme folds and substrate tunnels. The structures shown 

are of the Moorella thermoacetica CODH-ACS complex (a), Azotobacter 
vinelandii nitrogenase (b), Methanothermobacter thermautotrophicus 
(formerly Methanobacterium thermoautotrophicum) MCR (c), 
Desulfovibrio desulfuricans [FeFe]-hydrogenase (d), Methanocaldococcus 
jannaschii [Fe]-hydrogenase (e) and Desulfovibrio fructosovorans |NiFe]- 
hydrogenase (f), corresponding to Protein Dank Bank depositions with 
codes 1OAO”, 1M1N”, IHBN”®, 1HFE™, 3F47 (ref. 8) and 1YQW”, 
respectively. Selected tunnel-forming cavities are coloured green, light- 
blue or yellow, depending on whether they are mainly hydrophobic, 
hydrophilic or amphiphilic, respectively. The probe radii used to display 
these cavities are 0.8 A for CODH-ACS and [NiFe]-hydrogenase, 1.4 A 
for MCR, 0.6 A for [FeFe] -hydrogenase and 1.0 A for [FeMo] -nitrogenase. 
Arrows indicate proposed main gas entrance/exit sites at the enzyme 
surface. Crystallographically detected xenon sites are shown as large dark- 
blue spheres, and CO,-binding sites (in CODH-ACS) are shown as small 


situation may apply to [FeFe]-hydrogenases because the H,,, 1» state, 
which is postulated to bind H, (ref. 14) (Fig. 2c), is the only catalytically 
relevant observable EPR-active species. 

In the hyperthermophilic bacterium Carboxydothermus hydrogeno- 
formans, which contains five CODHs”, CO is used both as a carbon and 
as a reducing power source (Table 1, reaction 2). It has been postulated 
that the enzyme in the CO,-binding C,,,, state contains a Ni* ion and 
a bound Fel-Ni-bridging hydride’ (Fig. 2f). The nickel coordination 
seems to be square planar in most of the reaction intermediates, with the 
exception of the CO-bound species, in which it is tetrahedral. The prop- 
osition that the C,,,, state binds OH’ between nickel and Fel (ref. 18) has 
been confirmed by X-ray structures’. CODH can form a complex with 
ACS that, in acetogens, functions in the anaerobic Wood-Ljungdahl 
pathway of CO, fixation (Table 1, reactions 2 and 3)*; in methanogens 
growing on acetate, it catalyses the reverse reactions as part of the acetyl- 
SCoA decarbonylase/synthase complex”. In the bifunctional CODH- 
ACS, CO produced at the C-cluster diffuses through a tunnel and binds 
at the A-cluster (Fig. 3). 

Although the net electronic change during catalysis is zero, two elec- 
trons are required for the oxidative addition of CH," to the proximal 
nickel site of the A-cluster (Table 1, reaction 3; Fig. 2e). These electrons 
could be temporarily donated through the oxidation of Ni,’ to Ni,” or 
Ni,[Fe,S,]* to Ni, “[Fe,S,]”*, or, less likely, the formation ofa disulphide 
bond from two thiolate ligands at the active site. Methylation of Ni’ has 
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black and red spheres for C and O, respectively. The xenon site in [FeFe]- 
hydrogenase and the CO, sites in CODH-ACS correspond to our own 
recent results. Other atoms are coloured as in Fig. 1. For the enzymes with 
internal two-fold symmetry (CODH, ACS, MCR, [Fe]-hydrogenase and 
[FeMo]-nitrogenase), one half is depicted in grey, showing 

secondary structure elements as ribbons (helices) and arrows (}-sheets). 
All other subunits are shown as Ca traces, coloured according to domain 
and/or subunit. The arrows labelled P and M indicate the P-cluster and 
the FeMo-co in nitrogenase, respectively. All structures are shown on the 
same scale except CoBSH and CoMSH, which are magnified by a factor of 
two for clarity. Subunits are labelled using Greek symbols for CODH-ACS, 
MCR and nitrogenase. Domains are labelled with numbers. For the three 
hydrogenases, S and L respectively indicate small and large 

subunits, H and F respectively indicate H-cluster and ferredoxin-like 
domains, and N and C respectively indicate amino- and carboxy-terminal 
domains. 


been shown in a model compound lacking an [Fe,S,] cluster”. How- 
ever, calculations using density functional theory indicate that addi- 
tion of [Fe,S,]** toa truncated Ni,""Ni,’ model leads to Ni,'Ni,"[Fe,S,]* 
(refs 40, 41), a state recently observed using Mossbauer spectroscopy”. 
The order of CO and CH, binding to the A-cluster is still the subject of 
considerable debate*’. Figure 2e depicts the two possibilities. Another 
mechanism” involves the paramagnetic OC-Ni,'[Fe,S,]”* state; how- 
ever, the oxidative addition of CH; would require a second electron, 
the source of which remains undefined. After being synthesized by an 
insertion/migration reaction, the nascent acetyl group binds to CoAS , 
regenerating the initial two-electrons-more-reduced active-site state by 
reductive elimination. 

Regardless of their carbon source, all methanogens evolve CH, 
through a mechanism catalysed by MCR (Table 1, reaction 5; Figs 1 
and 2d). It has been shown that in the functional active enzyme the 
nickel ion bound to F,3y is in the +1 redox state** (Fig. 1). Accord- 
ing to the structure of the Methanocaldococcus jannaschii enzyme, 
methyl-SCoM must bind before the second substrate, CoBSH (7-thio- 
heptanoyl-threonine phosphate), because the binding of CoBSH 
completely blocks access to the nickel ion. Although theoretical cal- 
culations have proposed a mechanism whereby CH, is liberated upon 
CH,-SCoM binding to nickel” with the transient formation of a Ni- 
SCoM complex, direct evidence of the formation ofa stable Ni**-CH, 
adduct*”** favours the mechanism depicted in Fig. 2d. The overall 
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reaction implies the formation of a C-H bond on the methyl group 
to yield CH, and CoMS-SCoB (Table 1, reaction 5; Fig. 2d). Release 
of CH, from the enzyme possibly occurs only after CoMS-SCoB dis- 
sociation’’. The trans—cis transition of an unusual thio-oxopeptide 
bond (Fig. 1) has been postulated to have a negative allosteric effect 
on the other active site in the MCR dimer”. 

Perhaps the most complex biological reaction involving gases is the 
reduction of N, to NH, (Table 1, reaction 4). Although the crystal struc- 
ture of nitrogenase and its FeMo-co (Fig. 1) has been available for more 
than 15 years”, there is still considerable debate concerning the catalytic 
site’. In its resting state, the cofactor does not bind N, and the molybde- 
num ion does not have a vacant coordination site. The substrate binding 
site(s) at this ion may become available after reduction if some of the 
metal-ligand bonds are broken (for example the molybdenum-homo- 
citrate bonds), but there is no structural evidence for this. Researchers 
favouring iron-based catalysis point out, for instance, site-directed muta- 
genesis, and crystallographic and spectroscopic studies using substrates 
other than N, (ref. 50), which indicate that an FeS face of FEMo-co could 
be involved in substrate binding. They also argue that related nitrogenases 
in which molybdenum has been replaced by either vanadium or iron are 
still active, although a substitute iron ion, in particular, should not be 
able to reproduce the molybdenum-based chemistry proposed in ref. 51. 
Intriguingly, H, formation and its subsequent evolution is a requirement 
for N, binding (Table 1, reaction 4; Fig. 2b). Calculations using density 
functional theory suggest that proton binding to the FeMo-co does not 
involve high activation barriers”. The discovery of an interstitial light 
atom in the FeMo-co has attracted much attention”. Several studies sug- 
gest that nitrogen is the most likely candidate”, either having only a struc- 
tural role, stabilizing the FeEMo-co””’, or also participating in catalysis”. 
Figure 2b shows molybdenum- and iron-based mechanisms. 


Crystal structures and gas tunnels 

Figure 3 shows the three-dimensional structures of the anaerobic gas- 
metabolizing enzymes discussed in this Review. They span a wide range 
in complexity, from relatively small monomeric and heterodimeric 
[FeFe] -hydrogenases™, homodimeric [Fe]-hydrogenase® and hetero- 
dimeric [NiFe]-hydrogenases” to larger homodimeric CODHs”™, an 
(a), dimer of heterodimers in nitrogenase (where a and f respectively 
correspond to the NifD and NifK subunits”), and dimers of heterotrim- 
ers — (aBy), — in the case of nickel-dependent MCR”. ACS can form 
an (a,8,) complex with CODH””*. Nitrogenase, MCR and CODH have 
in common the presence of pseudo two-fold axes that lie perpendicular 
to proper two-fold axes. These pseudo-symmetry axes relate respective 
a and B subunits in nitrogenase and MCR, and domains {3 and 64 in 
CODH. Metal sites such as the nitrogenase [8Fe—7S] P-cluster™ (Fig. 3) 
and the CODH C-cluster sit on the pseudo two-fold axes, suggesting that 
these complex structures evolved from simpler ones by gene duplication 
and divergent evolution. Indeed, most active sites are located at or close 
to subunit and/or domain interfaces (Fig. 3): the nickel-containing F,;, 
cofactor lies between subunits a, B, y and a’ of MCR (Fig. 1); the FeMo- 
co lies between three domains of the nitrogenase a-subunit; the ACS 
A-cluster lies close to the interface of its three domains; the H-cluster 
of [FeFe]-hydrogenase is bound to a region of the protein facing the 
ferredoxin-like domain; and the iron—guanylylpyridinol cofactor of [Fe]- 
hydrogenase is bound at the surface of the N-terminal domain facing 
the dimer-forming C terminus. An exception is the Ni-Fe site in [NiFe]- 
hydrogenase, which is buried in its large subunit. 

Apart from [Fe]-hydrogenase, the studied enzymes have tunnels to 
transfer gaseous substrates and/or products to and from the active sites. 
In [NiFe]-hydrogenase, [FeFe]-hydrogenase, CODH and ACS, these 
tunnels are mainly hydrophobic. Experiments with xenon-pressurized 
crystals'”*”*’°* have shown that this gas binds in the observed tunnels 
even though most of them appear too narrow to allow xenon diffusion. 
This, in turn, means that, even in the crystal, significant protein motions 
can occur. 

Molecular dynamics studies of intramolecular gas diffusion have been 
reported for [NiFe]- and [FeFe]-hydrogenases. Initial results obtained 
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for [NiFe]-hydrogenases suggested that H, does not diffuse randomly 
in the protein matrix, but instead uses the crystallographically charac- 
terized tunnels”. In addition, large sections of the tunnels were pos- 
tulated to be gas reservoirs because of the low concentration of H, in 
natural settings and the higher solubility of this gas in a hydrophobic 
medium. In the [NiFe]-hydrogenase study”, several gas molecules were 
allowed to diffuse from the observed xenon sites, and in some trajec- 
tories they reached the active site. These results were complemented 
later by a theoretical study in which an in silico Val-to-Ala mutation 
in the tunnel near the active site locally increased H, diffusion from 
the solvent”. However, electrochemical experiments showed that the 
[NiFe]-hydrogenase catalytic rate for hydrogen oxidation matches 
that of platinum, indicating that the protein tunnels are not a barrier 
to H, diffusion”. Experimentally, mutants with the Val mentioned 
above replaced by Met or Ile, which are bulkier, displayed significantly 
reduced CO and H, diffusion rates*. Molecular dynamics studies of H, 
and O, diffusion in [FeFe]-hydrogenase suggested two major pathways 
for these gases. One coincides with a narrow hydrophobic tunnel that 
was previously characterized crystallographically™ and found to bind 
one xenon atom (Fig. 3). The other tunnel depends on the dynamics of 
the protein matrix. 

Nitrogenase contains a hydrophilic tunnel extending from the enzyme 
surface to the FeMo-co. A potential role for this tunnel in N, and/or H, 
diffusion is suggested by the presence of one xenon site next to it, close 
to the active site®. Because N, reduction to ammonia is slow (Table 1, 
reaction 4), a hydrophobic tunnel that would allow rapid gas access to 
the FeMo-co has not evolved. Another possible use for the hydrophilic 
internal pathway in nitrogenase™ could be the evacuation of product as 
NH,". In MCR, the active site is accessible only through an amphiphilic 
tunnel located at the intersection of four subunits. The tunnel becomes 
completely obstructed after successive binding of CH;-SCoM and 
CoBSH. In the crystal structure with both CoMSH and CoBSH bound, 
a weak electron density peak, in a hydrophobic pocket between the two 
coenzymes, was tentatively interpreted as trapped CH, (ref. 45). 

Available structures of ACS show that another way in which the pro- 
tein may guide substrate access to the active site is through domain 
rearrangements that modify the tunnel network. In the ACS closed 
conformation’’”*, a continuous 70-A-long tunnel connects the C- and 
A-clusters such that CO produced at the C-cluster can diffuse and be 
used at the A-cluster without being released to the medium. The tunnel 
connection is interrupted in the ACS open conformation”, in which the 
A-cluster presumably becomes accessible to methyl transfer from the 
corrinoid iron-sulphur protein (Table 1, reaction 3). 

In conclusion, cavity searches on three-dimensional structures and 
xenon-binding studies reveal at least some of the tunnels used for gas 
transport. Molecular dynamics simulations have shown that the size of 
these crystallographically observed cavities may vary owing to protein 
movements and that these movements may generate transient cavities 
that constitute additional pathways for gas diffusion. 


Active-site biosyntheses 

The complex active sites discussed here (Fig. 1) are either pre-assembled 
in accessory scaffold proteins before being inserted or progressively 
built directly in the target apoenzyme. Understanding these maturation 
processes (Fig. 4) may be useful in explaining their evolution from a 
prebiotic setting and the overexpression of enzymes for potential bio- 
technological applications. 

The extensively studied maturation of [NiFe]-hydrogenase 3 from 
Escherichia coli’ involves at least the eight gene products from the hyp 
operon. These proteins are not only species specific, but in E. coli differ- 
ent hydrogenases have their own maturation machinery. The complete 
active site is assembled directly in the HycE apohydrogenase (Fig. 4). The 
[FeFe]-hydrogenase active site is also buried in the protein core (Fig. 3). 
The [FeFe] subcluster is covalently bound to the protein by only one Cys 
thiolate (Fig. 1). So far, only three gene products, HydE, HydF and HydG, 
have been identified as being required for expressing an active [FeFe]- 
hydrogenase in E. coli that lack this enzyme™ (Fig. 4). Nothing is known 
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Figure 4 | Active-site biosyntheses. a, [NiFe]-hydrogenase. Biosynthesis 
comprises carbamoyl phosphate synthesis by CarAB from L-Gln and 
bicarbonate’, carbamoyl phosphate ATP-dependent conversion to 
AMP-OC(O)NH, by Hypk, transfer of the carbamoyl moiety to the HypE 
C-terminal Cys, ATP-dependent dehydration of the formed thiocarbamate 
to thiocyanate” and CN’ transfer (triangles for 1CN’, diamond for 2CN™ 
ligands) to a HypCD complex containing the iron ion already bound to 
HypC”*. An equivalent pathway for CO synthesis has not been defined, 
but externally supplied CO can also be incorporated, presumably” before 
transfer of the Fe(CN),CO unit (double diamond) from HypC to HycE. 
Subsequent nickel insertion is mediated by HypA (proposed nickel carrier), 
HypB (GTP-dependent nickel insertase) and SlyD (affinity modulator of 
HypB for nickel”). Finally, HycI cleaves the C-terminal extension of HycE 
and the nickel-based nascent C terminus rearranges, burying the active 
site in the protein core (double square)”". b, [FeFe]-hydrogenase. HydE 
and HydG are radical S-adenosyl-t-methionine (SAM) proteins”. HydG 
resembles ThiH, which is involved in thiazole synthesis and also produces 
p-cresol and dehydroglycine from Tyr. Dehydroglycine could be used to 


about the synthesis of CO and guanylylpyridinol or iron incorporation in 
[Fe]-hydrogenase (Hmd) (Fig. 1). However, in all the known methanogen 
genomes that contain the hmd gene, there is an adjacent gene, which we 
call hmdB, coding for a radical S-adenosyl-L-methionine (SAM) pro- 
tein. Although hmaB has been annotated as biotin synthase, amino-acid 
sequence analysis indicates that this is not the case. HmdB could be 
involved in either CO or guanylylpyridinol synthesis. 

Active nitrogenase corresponds to the NifDK (a,f,) heterotetramer 
transiently associated with the iron protein NifH® (Fig. 4). The highly 
complex FeMo-co is only bound to two amino acids from the protein 
matrix (Fig. 1) and can be easily extracted from it®. Its biosynthetic 
pathway involves ten proteins (NifS, NifU, NifB, NifX, NifE, NifN, NifV, 
NifQ, NifH and NifY) encoded by the nifoperon. These proteins can be 
functionally divided into molecular scaffolds, metallocluster carriers 
and enzymes. Although it is possible to assemble a functional FeMo-co 
in vitro using only iron, sulphur, molybdenum, R-homocitrate and the 
proteins NifB, NifH and NifEN, as well as ATP and SAM, the addition 
of the other Nif proteins doubles the yield of active enzyme™. 


ATP Molybdenum 


synthesize the small [FeFe]-bridging dithiolate molecule (diamond)”’. 

The function of HydE is unknown, but it can bind thiocyanate and small 
organic molecules containing at least two carboxylate moieties at different 
sites”. The products of HydE and HydG (double diamond) bind to HydF, a 
presumed scaffold protein exhibiting GTPase activity that transfers the pre- 
assembled Fe,(S-C-X-C-S)(CN),(CO), unit next to the [Fe,S,] subcluster 
(double diamond), producing active [FeFe] -hydrogenase”’. ¢, Nitrogenase 
(NifDK). The Cys desulphurase NifS directs the assembly of [FeS] clusters 
(diamonds) on the scaffold NifU®. These clusters serve as substrates for 
the radical SAM protein NifB to synthesize the NifB cofactor (double 
diamonds), of unknown structure, on its C-terminal domain, which is 
homologous to NifX°. NifX carries the cofactor to the central scaffold 
protein NifEN (triple square), which is homologous to NifDK*. Then 
R-homocitrate and molybdenum are transferred to NifEN by NifV”’ and 
NifQ, respectively”. Molybdenum delivery and insertion seem to involve 
ATP hydrolysis and NifH, which resembles in this respect HypB and HydF. 
Finally, the carrier protein NafY (also known as NifY) transfers the pre- 
assembled FeMo-co (quadruple square) to NifDK. 


Very little is known about the synthesis of the C-cluster of CODH. It 
is thought that its FeS cuboidal part (Fig. 1) is assembled in the enzyme 
before nickel insertion”, this insertion being mediated by the proteins 
CooC, CooT and CooJ””’. CooT shows low amino-acid sequence 
homology to HypC (Fig. 4). CooJ has a His-rich, nickel-binding motif 
at its C terminus, and CooC, which is responsible for nickel inser- 
tion, is similar to HypB and has low ATPase activity. No details have 
been reported concerning the synthesis of the ACS A-cluster. Also, the 
enzymes specific to the synthesis of the MCR Ni-F,,;) cofactor and its 
machinery for nickel insertion are unknown. However, four amino-acid 
side chains are modified by specific methyltransferases”””*. Furthermore, 
a Gly peptide bond is converted into a thio-oxopeptide bond”. 

Non-iron metal insertion may occur through a common mechanism 
in nitrogenase, [NiFe]-hydrogenase and CODH: NifH and HypB are 
structurally similar, and CooC has significant amino-acid sequence 
homology to them”’*”. They couple metal insertion to the hydrolysis 
of either GTP or ATP. In the three cases, the non-iron metal (nickel or 
molybdenum) is bound to one or more amino acids of the target protein 
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Figure 5 | Model compounds. a, [FeFe]-hydrogenase active-site model, [Fe,(adt) 
(CO),(dppv),(t-H)]* (ref. 15). b, [NiFe]-hydrogenase active-site biomimetic model, [(CN),(CO) 
Fe(u-tpdt)Ni(S,CNEt,)] 7” ¢, Functional model of hydrogenase, [Ni(P®,N*,),]**. R, phosphine 

or cymene; R’, phosphine or benzene”’. Depending on the substituent on the amine nitrogen 

and on the phosphorus atoms, these complexes are electrocatalysts for hydrogen production 
(turnover frequency, 350 s') or for hydrogen oxidation (turnover frequency, 10s”). d, [Fe]- 
hydrogenase active-site model, [Fe(CO),(PPh,)I(hmp)]”*. This monomeric iron bis(carbonyl) 
pyridonate complex provided further evidence that the iron is divalent in the enzyme’. e, Structure 
of [HIPTN,N]Mo(N,). This molybdenum compound is not biologically relevant, but it can 
reduce N, with electrons and protons to ammonia and hydrogen” '_R”, HIPT. f, ACS active-site 
model, [Ni(dppe)Ni(PhPepS)]”, showing a labile nickel with changing coordination geometry, 
possible reduction to Ni* and CO binding with an infrared frequency very close to that observed 
in the OC-Ni,” [Fe,S,]** state of the enzyme. g, CODH active-site model, [(tdt)NiFe,S,(LS,)]*, 
with a distorted square-planar diamagnetic Ni* coordination and a possible insertion of an 

‘exo’ iron by substitution”. h, Mechanistic interpretation of the photoinduced formation of 
methane and spirodisulphide by bis{1-[2-(methylthio)ethyl]cyclohexanethiolato}Ni”™ via a cyclic 
sulphuranyl radical, without true catalytic turnover’. adt, 2-azapropane-1,3-dithiolate; dppe, 
diphenylphosphinoethane; dppy, cis-1,2-bis(diphenylphosphino)ethylene; Et, ethyl; t-H, terminal 
hydride; HIPT, hexa-isopropylterphenyl; hmp, deprotonated 2-hydroxy-6-methylpyridine; 

LS,, 1,3,5-tris((4,6-dimethyl-3-mercaptophenyl)thio)-2,4,6-tris(p-tolylthio)benzene(3-); Ph, 
phenyl; PhPepS, N,N’-phenylenebis(o-mercaptobenzamide); PPh,, diphenylphosphine; PPh,, 
triphenylphosphine; tdt, toluene-3,4-dithiolate; tpdt, 3-thiapentanedithiolate. 


reaction intermediates. In the bio-inspired approach, 
the functional model is not structurally similar to 
the active site, but the reaction is the same. In gen- 
eral, the biomimetic compounds so far reported 
are not catalytically active, either because they are 
not identical to the biological active site or because 
they do not mimic the protein environment and the 
second coordination sphere. Synthetic catalysts are 
often insoluble in aqueous solvents, in which case 
the catalysis is performed in organic solvents. This 
does not have to be a drawback, because non-polar 
solvents may be better mimics of the protein matrix 
than water, although the protein interior is a mosaic 
of charged, polar and non-polar components, which 
is difficult to mimic in the laboratory. As illustrated 
by FeMo-co in nitrogenase”, extracted active sites 
may be stable but become inactive, underscoring the 
importance of the protein. 

Figure 5 shows selected model compounds. There 
is a vested interest in the synthesis of mimics of 
hydrogenases based on nickel and/or iron instead of 
platinum, because the availability of platinum will 
not satisfy the demands of a hydrogen economy. An 
interesting recently reported compound related to 
[FeFe]-hydrogenase is a complex of two iron ions, 
bridged by CO and an azadithiolate (Fig. 5a), that 
shows the coupling of amine protonation by weak 
acids to hydride formation. This result favours 
nitrogen as bridgehead atom in the small dithiolate- 
containing molecule of the [FeFe] subcluster”. 
Although the first thiolate-bridged Ni-Fe carbonyl 
complex was synthesized in 1996 (ref. 76) the closest 
mimic of the [NiFe]-hydrogenase yet obtained was 
reported very recently (Fig. 5b). The infrared fre- 
quency bands from its CO(CN), unit are very close 
to those from the enzyme”. However, the binuclear 
Ni-Fe synthetic models are not functional. One of the 
best catalytically active models is the mononuclear 
nickel complex [Ni(P*,N™,),]”*, in which pendant 
bases are incorporated in the second coordination 
sphere” (Fig. 5c). Concerning [Fe]-hydrogenase, an 
iridium-containing mimic of guanylylpyridinol has 
proved to be an excellent catalyst for dehydrogenation 
reactions. It was also very stable relative to complexes 
in which the guanylylpyridinol mimic lacks the 2-OH 
group present in its biological counterpart”. An iron- 


and is inserted at a late stage of the assembly process. Active-site assem- 
bly in [FeFe]-hydrogenase does not involve non-iron ion insertion and 
proceeds by a different mechanism. Both the catalytic [FeFe] subcluster 
of [FeFe]-hydrogenase and the FeMo-co of nitrogenase are fully synthe- 
sized in a scaffold protein (HydF or NifEN), possibly located at the core 
of a multi-enzymatic complex”, before their insertion into the respective 
enzymes. The transfer of the catalytic centres as fully assembled units, 
along with the paucity of protein ligands and their iron/sulphur content, 
suggest a mineral origin. Interestingly, for both enzymes radical SAM 
proteins are involved at key steps in the active-site synthesis (Fig. 4). The 
same may apply to [Fe]-hydrogenase, underscoring the importance of 
radical chemistry at the early stages of life evolution. 


Model compounds 

The synthesis of metalloenzyme active-site analogues is both a chemi- 
cal challenge and a way to better understand the properties of their 
different components. Model compounds also shed light on the role 
of the protein environment in modulating the enzymatic properties. 
Depending on the desired goal, chemists use either biomimetic or bio- 
inspired approaches. Biomimetic approaches consist of the synthesis 
of both structural analogues of the enzyme active site and plausible 
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containing model is depicted in Fig. 5d. 

The development of new methods to assemble iron-sulphur-thiolate 
and iron-molybdenum-thiolate clusters of high nuclearity led to the 
isolation of a cluster displaying the nitrogenase FeMo-co topology with 
sulphide rather than a lighter atom at its centre*’. Di-iron complexes 
capable of cleaving the nitrogen-nitrogen bond of hydrazine with excel- 
lent catalytic activity support the possibility that di-iron centres have a 
role in biological N, reduction®’. Some molybdenum complexes are 
also efficient for the production of ammonia and are capable of reduc- 
ing nitrogen, albeit with limited catalytic turnover numbers” (Fig. 5e). 
Carbon monoxide binding is very well characterized in ACS models. 
The synthesis of complexes with three coplanar Ni* centres is interest- 
ing because of the resemblance of the central nickel ion to Ni, (ref. 82). 
A dinuclear nickel model sharing features with the ACS active site is 
shown in Fig. 5f. Although several complexes have been synthesized 
that involve an [Fe,S,] cluster leading to the formation of an [Fe,S,]- 
([o- SR)-Ni,”* unit, none of them contains a binuclear fragment”. The 
two limiting steps in the synthesis of CODH models are the construc- 
tion ofa planar diamagnetic Ni* in an iron—sulphur cluster and the sta- 
bilization of an ‘exo iron (Fe1) bridged to the larger core fragment. A 
[(tdt)NiFe;S,(LS,)]* cluster has been synthesized for the future insertion 
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of an ‘exo iron by substitution™ (Fig. 5g). The cleavage of the carbon- 
sulphur bond of methyl-SCoM in MCR poses a difficult chemical prob- 
lem. The first example of biomimetic production of methane and a mixed 
disulphide by catalysis with a Ni** complex was reported in 1997 (ref. 85). 
An interesting catalytic model was obtained through the synthesis of a 
substrate containing an aliphatic thiol and a methyl thioether function 
(Fig. 5h). 

In conclusion, chemical models of enzyme active sites are useful for 
understanding how ligands modify the properties of metal ions, the 
influence of the second coordination sphere and the effect of the protein 
matrix. As a general rule, it seems that bio-inspired rather than bio- 
mimetic compounds may, eventually, find technological applications. 


Perspective 

Now that most metal-containing enzyme active sites from anaerobic 
organisms have been structurally characterized in at least one state, one 
of the future challenges will be to obtain crystal structures of reaction 
intermediates. For example, a nitrogenase-N, complex or a methylated 
and/or carbonylated form of ACS would shed considerable light on the 
respective enzymatic mechanisms. Hydrogenases pose especially dif- 
ficult problems for the crystallographer because substrate and product 
are invisible at the resolution typically obtained from protein crystals. 
A similar general problem is the discrimination between light atoms 
such as carbon, nitrogen and oxygen. In such cases, as in many oth- 
ers, spectroscopic, theoretical and synthetic approaches have been, 
and will continue to be, essential. Another aspect that will have to be 
further developed is our understanding of the biosynthesis of the com- 
plex active sites discussed here. Even in the best-studied cases, there 
remain many points to clarify. A fascinating aspect of the active-site 
assembly of nitrogenase and [FeFe]-hydrogenase, and most probably 
[Fe]-hydrogenase, is the underlying radical chemistry that leads to the 
synthesis of some of its components. Understanding how radical SAM 
enzymes carry out these reactions will be of paramount importance. 
As we continue to advance our knowledge concerning these ancient 
gas-processing enzymes, we will also be looking back into the distant 
past to when life first arose on this planet. a 


Note added in proof: A recent model of a closed form of [Fe]-hydrog- 
enase'” suggests that a 10-A-long hydrophobic channel connecting the 
active-site cavity to the protein surface might be used by H,. 
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Metalloproteins and metal sensing 


Kevin J. Waldron’, Julian C. Rutherford’, Dianne Ford' & Nigel J. Robinson’ 


Almost half of all enzymes must associate with a particular metal to function. An ambition is to understand 

why each metal-protein partnership arose and how it is maintained. Metal availability provides part of the 
explanation, and has changed over geological time and varies between habitats but is held within vital limits 

in cells. Such homeostasis needs metal sensors, and there is an ongoing search to discover the metal-sensing 
mechanisms. For metalloproteins to acquire the right metals, metal sensors must correctly distinguish between 


the inorganic elements. 


The physical and chemical properties of a selected metal satisfy a pro- 
tein’s need to form structure, as for zinc-fingers, or to drive catalysis 
(Box 1). Proteins have also evolved to use those metals that are, or at 
least once were, most accessible’. A tenet of the cell biology of metals 
is that some metals tend to bind organic molecules more avidly than 
others. The natural order of stability for divalent metals, often called 
the Irving- Williams series’, sets out a resulting trend with copper and 
zinc forming the tightest complexes, then nickel and cobalt, followed 
by ferrous iron and manganese and finally, forming the weakest com- 
plexes, calcium and magnesium (Box 1). Biology must overcome this 
trend for some proteins to bind uncompetitive metals and for other 
proteins in the same cell to bind competitive metals. Cells are not ideal 
solutions and kinetic factors can dominate the distribution of metals, for 
example where metals are delivered by metallochaperones. Nonetheless, 
excluding the wrong metals from proteins may be more challenging 
than acquiring the right ones. 

A truism is that there has only been selection for most metalloproteins 
to obtain the correct element in the context of a cellular environment, with 
implications for the metal status of proteins expressed in heterologous 
cells and in vitro. Two proteins, one copper protein and one manganese 
protein in the periplasm of a cyanobacterium, illustrate this observation’. 
The folding compartment for the manganese protein MncA overrides its 
binding preference to control its metal content. The two proteins share 
similar cupin folds and the same protein ligands. Both prefer copper to 
manganese in vitro, consistent with the Irving- Williams series, but the 
manganese protein folds before export by means of the Tat translocase. 
It therefore traps manganese in the cytosol rather than the periplasm. A 
keystone of this article will be that metal availability is controlled in the 
cytoplasm**. Metal importers, metal exporters and metal stores maintain 
a limited supply of competitive metals such as copper. Thus, proteins 
compete with other molecules for these metals rather than metals com- 
peting with other metals for proteins. To achieve this state, the amount 
of each metal is somehow sensed to adjust the actions of transporters (at 
plasma membranes or internal compartments) and storage proteins for 
each element”. Sensors also adjust metabolism to minimize demand for 
scarce metals or to exploit abundant ones. These sensors are thus pivotal 
to metal selectivity because their capacity to distinguish the correct metal 
affects metal occupancy of other metalloproteins. 

Here we summarize the use and acquisition of metals by proteins, 
and then give an overview of how cells sense metals. Multiple bacterial 
metal-binding, metal-sensing transcriptional regulators are known and 
have recently been reviewed elsewhere’. Metal specificity of bacterial 
metal sensors is determined by affinity, allostery and access’. Here these 


three concepts are used to question where metal discrimination lies in 
eukaryotic metal sensing. In yeast, some metal-responsive transcriptional 
regulators have been discovered, but their molecular mechanisms may 
harbour surprises. In mammals, one metal-binding, metal-sensing tran- 
scriptional regulator and a post-transcriptional regulator are known’, 
but innumerable metal-responsive events, including elaborate switches in 


protein trafficking, are controlled by largely uncharted metal sensors. 


An appraisal of metals used in proteins 
The time is ripe to survey the use of metals in proteins, by exploiting 
bioinformatics and proteomics. Non-denaturing conditions must be 
used in proteomics if metal cofactors are to be retained”, and this can 
be technically challenging, there being a potential for metal exchange, 
acquisition or loss during fractionation. A commonly cited approxima- 
tion is that one-third of proteins require metals. A systematic bioinfor- 
matics survey of 1,371 different enzymes for which three-dimensional 
structures are known estimated that 47% required metals, with 41% 
containing metals at their catalytic centres”’. It is noteworthy that com- 
plicated metal centres“* can remain poorly defined even after structure 
determination. Metalloenzymes occur in all six Enzyme Commission 
classes, accounting for 44% of oxidoreductases, 40% of transferases, 39% 
of hydrolases, 36% of lyases, 36% of isomerases and 59% of ligases’’. 
Magnesium is the most prevalent metal in metalloenzymes, although 
it is often involved in loose partnerships with phosphate-containing 
substrates such as ATP and is sometimes interchangeable with manga- 
nese. The frequency of manganese in protein structures may overesti- 
mate its use in vivo, where magnesium is truly the cofactor (Fig. 1a). 
A catalogue of the principal type of enzyme that uses each metal 
reveals that iron (81%), copper (93%) and molybdenum plus tung- 
sten (81%) are most commonly used as conduits for electrons in oxi- 
doreductases’* (Fig. 1b). Cobalt and molybdenum are found almost 
exclusively in association with cofactors in vitamin-B,,-dependent and 
molybdopterin-dependent enzymes (see page 839). The proportion of 
all proteins, not just enzymes, using metals is expected to be less than 
47% and the relative contributions of the different metals may alter as 
the metals that perform structural roles, such as zinc in zinc-fingers, are 
more fully accounted for. 


Evolving metal availability and use 

The metals available to organisms have altered during evolution as spe- 
cies have occupied new niches and in response to massive environmen- 
tal changes, for example the liberation of dioxygen by photosynthesis, 
which significantly changed metal solubility’ *. Crucial enzymes, which 
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Box 1| The selection of metals 


At catalytic centres, metals increase acidity, electrophilicity and/or 
nucleophilicity of reacting species, promote heterolysis, or receive 
and donate electrons”. The protein's primary and secondary metal- 
coordination spheres tune the properties of the metal to optimize 
reactivity and influence metal selection. Donor ligands (S, O or N) can 
impart bias in favour of the correct metal. The metal-binding pocket 
can exclude ions with the wrong charge. Coordination geometry 
(octahedral, tetrahedral, square pyramidal, trigonal bipyramidal, square 
planar, trigonal or linear) can impart bias either in folded apoproteins 
if the preformed site is rigid or during folding if favourable energetics 
is coupled to the correct geometry. The figure shows the dominant 
geometries of four divalent metals in proteins”. 


However, because proteins have flexibility, steric selection between 
metals is imperfect, especially in nascent polypeptides. Under these 
conditions, the relative affinities of metals for proteins are significantly 
governed by the ligand field stabilization energies of the metals 
themselves. This creates the universal orders of preference, which for 
divalent metals is the Irving-Williams series. There is ambiguity about 
the position of zinc, which is either at the top of the series or somewhere 


above cobalt. This ambiguity is attributed to the nephelauxetic effect”. 
Cuprous ions, expected to dominate in more reducing cell environments, 
are also competitive, and some exceptionally tight ferric complexes are 
known. Crucially, such affinity series underpin calculations that each 
metal's relative abundance in the biological locality is paramount in 
governing selective metal binding by proteins’, highlighting the vital 
contribution of cell biology to the selection of metals by metalloproteins. 


cannot be readily replaced, have created dependences on metal spe- 
cies that were once plentiful but are now scarce. For example, under 
anoxic conditions iron is soluble in ferrous form, but in the presence 
of dioxygen ferric iron is poorly soluble; hence, plants and microbes 
release siderophores and plants express root-surface metal reductases 
to scavenge meagre amounts of iron from the environment’*”*. Despite 
these efforts, plants remain a poor dietary source of iron and, in conse- 
quence, iron deficiency is the most common human dietary deficiency 
worldwide and one of the ten most prevalent causes of disease’. 

In today’s open oceans, the poor availability of iron, but the presence 
of some copper, has led organisms to replace the former and exploit the 
latter’”"*. The oceanic diatom Thalassiosira oceanica has switched from 
using iron-containing cytochrome c, to using copper-containing plas- 
tocyanin’””’, which was otherwise only known in some cyanobacteria 
and photosynthetic organisms containing chlorophyll b. The sulphur 
isotope record also implies that ancient oceans were dominated by sul- 
phide and, hence, metal sulphides’. Under these conditions, copper, zinc 
and cadmium were poorly available, but cobalt and nickel more avail- 
able’. Greater volcanic-nickel input also preceded this era’. Enzymes 
that use cobalt and nickel are considered to be metal relics from these 
times’*””°. Some carbonic anhydrases are cambialistic, capable of sub- 
stituting cobalt for zinc in ocean surface waters where zinc is in short 
supply”. Even more noteworthy are a subset of structurally modified 
carbonic anhydrases from marine diatoms that have abnormal loops 
near the active site to facilitate metal substitution with catalytically active 
cadmium”, a rare example of enzymes that have evolved to use this 
normally solely toxic metal. 

The exploitation of metals varies between species, and trends exist in the 
superkingdoms’. The slope of plots comparing the number of zinc-bind- 
ing domains with the total number of protein domains encoded by each 
genome is greater in eukaryotes than in archaea or bacteria; the reverse 
trend is true for iron-, manganese- and cobalt-binding domains’. An 
implication is that zinc was increasingly recruited as eukaryotic genomes 
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became more complex, and this is especially true for multicellular eukary- 
otes, with a point of inflection detected at the intersection between unicel- 
lular and multicellular species’. This reflects the eukaryotic diversification 
of structural zinc-binding domains, notably zinc-finger and RING-finger 
domains, which constitute ~3% and ~1% of the human proteome, respec- 
tively“. Zinc-fingers are prevalent in the control of gene expression asso- 
ciated with differentiation into multiple cell types. Archaea and bacteria 
have, as a proportion, more iron-sulphur proteins (see page 831) but fewer 
haem proteins than eukaryotes, and within the bacteria aerobic species 
have fewer iron-sulphur cluster proteins and more haem proteins than 
anaerobic bacteria’. 


Metal mis-population 

It might be predicted that a consequence of proteins having tight affini- 
ties for incorrect metals is a potential for metals to mislocate, yet few 
such examples are documented. This rarity may reflect an exceptional 
efficiency of metal homeostasis or perhaps the experimental effort nec- 
essary to establish which metals are bound to proteins inside cells. Bind- 
ing sites of proteins involved in metal homeostasis typically allow metal 
exchange, but other sites can be buried, with the metal being kinetically 
trapped and safe from replacement with an incorrect metal. Even so, the 
correct metal must somehow become trapped in the first place. 

A subset of Escherichia coli manganese superoxide dismutase, 
MnSOD, is known in vivo to acquire iron”, which is catalytically inac- 
tive. Recombinant MnSODs from other organisms, expressed in E. coli, 
mis-populate with iron, cobalt or nickel”. Eukaryotic MnSOD, in the 
mitochondrial matrix, acquires catalytically inactive iron when mito- 
chondrial manganese and/or iron homeostasis is perturbed”. Cobalt 
mimics hypoxia in humans possibly because cobalt replaces iron in 
prolyl 4-hydroxylase, which in iron-form hydroxylates the hypoxia- 
responsive transcription factor HIF1 (also known as HIF1A)**. The 
wrong metals can also mislocate to metal prosthetic groups”. A lack of 
specificity in ferrochelatase, the enzyme that inserts iron in the synthesis 
of haem, can give rise to metal mis-incorporation in vivo such that high 
concentrations of zinc protoporphyrin in the blood are used in the clini- 
cal diagnosis of iron deficiency”. 


Metallochaperones 

One way to ensure that the correct metal is acquired by a metalloenzyme 
is to exploit delivery proteins, that is, metallochaperones. For example, 
nickel is inserted into bacterial hydrogenase (see page 814) and urease 
by dedicated nickel metallochaperones. It is unclear where metallo- 
chaperones acquire metal. No single donor for either of two yeast cop- 
per metallochaperones emerged from a systematic analysis of mutants 
lacking transporters”. Metal transfer has nonetheless been observed in 
vitro between cytosolic regions of the copper importer Ctr1 and a cop- 
per chaperone”, and a recent electron crystallographic structure of Ctr1 
might be exploited to visualize how this could occur’. Metal is passed 
from metallochaperones to cognate apoproteins by means of ligand- 
exchange reactions’. Thus, the specificity of protein contact, and of 
the subsequent ligand-exchange reactions, determines which proteins 
gain access to those metals supplied by metallochaperones. However, 
rather few metallochaperones are known, so most proteins are assumed 
to obtain metal from exchangeable cellular pools, and the contributions 
of metal sensors thus become vital. 


A bacterial model for metal sensing 

Bacteria have sets of metal- and DNA-binding, metal-sensing tran- 
scriptional regulators classified into families of metal de-repressors 
(ArsR-SmtB, CsoR-RenR and CopY), metal co-repressors (Fur, NikR 
and DtxR) and metal activators (MerR)****. A few sensors also detect 
metals external to the plasma membrane. The affinities of cytosolic sen- 
sors for the metals they detect have been used to make inferences about 
the concentrations of metals available to proteins”*. On the assumption 
that the sensors undergo metal exchange with cytosolic metal pools, 
their affinities become the thresholds for homeostasis. In switching 
from apo to holo form, metal sensors alter production of proteins to 
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acquire, expel or sequester metal, so the concentrations at which the 
sensors bind metal may not be exceeded. The affinities for metal induc- 
ers increase as the sensors detect metals farther up the Irving— Williams 
series’, implying that the most competitive metals are bound and buff- 
ered to exceedingly low concentrations”*. For example, two zinc sensors 
of E. coli, Zur and ZntR, are estimated to have femtomolar zinc affini- 
ties’, which is analogous to there being fewer than one metal ion per 
E. coli cell volume (formally nanomolar). 

In addition to metal affinity, the following two other factors dictate 
which metals a sensor detects in vivo. 


Access 

Metal availability in the cytosol is not standardized across species 
The metals detected by a sensor can change when tested in a different 
bacterium, perhaps chosen because it was more tractable for experimen- 
tation**”’. Kinetic factors, perhaps mediated by protein interactions, also 
affect access and differ between species. 


25,36,37 


Allostery 

A manganese sensor from Bacillus subtilis®, a nickel sensor from E. coli” 
or Helicobacter pylori and a nickel/cobalt sensor from Mycobacterium 
tuberculosis” all bind metals with orders of affinity that match the Irving— 
Williams series. However, in these sensors only the correct metals bind in 
an optimal conformation to alter DNA binding allosterically. 


Metal sensors in yeast 

Studies of metal-regulated gene expression have focused on a set of 
DNA-binding transcription factors in Saccharomyces cerevisiae (Fig. 2a). 
Functionally equivalent factors occur in other yeasts such as Schizosac- 
charomyces pombe, although the complement and mechanisms vary". 
Metal-responsive signal transduction pathways emanating from metal 
sensing at the plasma membrane are either absent or yet to be dis- 
covered in yeast. Metalloregulation of these yeast DNA-binding tran- 
scription factors can involve metal-dependent change in nuclear versus 
cytosolic localization, metal-dependent repression of activation-domain 
function and metal-mediated change in DNA binding™. In S. cerevisiae, 
Zap1 responds to zinc, Mac1 plus Ace1 (also known as Cup2) to copper, 
and Aft! plus Aft2 to iron (Fig. 2a). 

The metal-responsive transcription factors regulate expression of a 
repertoire of transporters that import or compartmentalize metals” 
(Fig. 2b). Mac! has also been implicated in a pathway of metal-responsive 
transporter degradation” . Yeast cells store excess metal in vacuoles”, 
with plasma-membrane exporters known only for cadmium and arsenic, 
and some yeasts have metal-sequestering metallothioneins. The factors 
also control metabolic switching; for example, the transcription factors 
Aft] and Aft2 indirectly (through Cth1 and Cth2 (also known as Tis11)) 
reduce the stability of transcripts encoding iron-dependent enzymes 
when iron is in poor supply**””. The iron regulon is also induced at the 
diauxic shift to support the enhanced iron requirements of respiration”. 
The extent to which the responses are triggered by metal binding to the 
S. cerevisiae transcription factors themselves has been explored. 
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Metal sensing by metal binding 

Macl and Acel activate transcription in response to low and, respec- 
tively, high concentrations of copper in S. cerevisiae*”’. Cysteine-rich 
motifs in both Macl and Acel can bind copper in vitro to form poly- 
copper clusters that are bridged by thiolates”’. Structures of the clusters 
have been modelled from extended X-ray absorption fine structure 
(EXAFS) data” (Fig. 2c). As copper concentrations increase, there is a 
reciprocal relationship between the loss of activation of Macl-regulated 
genes and the induction of Acel-regulated genes”. A simple model 
is that at some critical intracellular copper concentration, clusters 
form in both proteins to repress Macl function while activating Acel. 
DNA binding by Acel is enhanced by copper in vitro”. It is not yet 
known whether Macl and Acel compete for the same pool of cop- 
per or whether copper is delivered by a metallochaperone. Polycopper 
clusters are structurally distinct and so have the potential to provide 
allosteric metal specificity in favour of copper through the unique pro- 
tein conformations they induce. 

Activation (or repression on some promoters) of gene expression in 
response to zinc is controlled by the Zap] transcription factor, a process 
that depends on its zinc-fingers. Zap1 has two transcriptional activation 
domains, one of which is located within an unusual pair of zinc-fingers™. 
The zinc in these atypical fingers is more labile than in the standard 
DNA-binding zinc-fingers”. Zap1 metal regulation by means of vari- 
able zinc occupancy of these atypical fingers is strongly supported by 
in vivo observations of fluorescence resonance energy transfer (FRET) 
between probes on adjacent fingers™ (Fig. 2d). The fluorophores only 
become close enough to allow energy transfer when the holo-fingers 
form, and this occurs only when zinc concentrations increase. Thus, for 
elevated zinc concentrations these atypical zinc-fingers become occu- 
pied by metal, masking the activation domain and thereby inactivat- 
ing gene expression. The zinc affinities of the labile Zap1 zinc-fingers, 
expressed without the rest of Zap1, are in the nanomolar range”. This 
might imply that intracellular available zinc accumulates to a higher 
concentration in yeast than in bacteria, but perhaps the zinc affinities 
are tighter in intact Zap1. There is cooperative binding of zinc to the 
two atypical fingers of Zap] (ref. 55), offering the potential to confer 
allosteric metal specificity. 


Indirect responses to metals 

Iron sensing in yeast is mediated by Aftl (ref. 56) (Fig. 2a) or its par- 
alogue, Aft2 (ref. 57). The more extensively studied Aft] undergoes 
iron-responsive nucleocytoplasmic shuttling ©. When the iron con- 
centration is low, Aft] accumulates in the nucleus. Nuclear Aft1 activates 
transcription of a regulon that includes genes encoding iron transport- 
ers” (Fig. 2b). Details of the iron-switch are being pieced together. The 
switch requires cysteine residues in either Aftl or Aft2 (refs 56, 57), two 
monothiol glutaredoxins (Grx3 and Grx4)” and two cytosolic proteins 
(Fral and Fra2)”. In vitro, Grx3 and Grx4 homodimers associate with 
a [2Fe-2S] cluster® and Aftl is known to be responsive to the degree 
of iron-sulphur-cluster biosynthesis in the mitochondria™. Aftl might 
respond to iron indirectly, with the proteins of iron—sulphur-cluster 


1004 Figure 1| Metals used in 
catalysis. a, The elements used as 
751 cofactors by enzymes are shown in 
504 blue. The height of each column 
represents the proportion of all 
257 enzymes with known structures using 
Ae all pone the respective metal. A single enzyme 
Mg Ca = Mn Fe uses cadmium”. b, The proportion 
1005 of proteins using the indicated metals 
751 that occur in each of the six enzyme 
classes: oxidoreductases (EC 1), blue; 
507 transferases (EC 2), yellow; hydrolases 
(EC 3), purple; lyases (EC 4), pink; 
254 iF isomerases (EC 5), green; ligases 
ol, ; ; toa  (EC6), grey. EC, Enzyme Commission. 
Co Cu Zn Mo+W _ Data obtained from ref. 13. 
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biosynthesis being the incipient discriminatory sensors. 

Classical signal transduction pathways may also be involved in copper 
sensing, because the DNA-binding form of Mac] is phosphorylated™. 
Furthermore, as hinted previously, no copper chaperone has been linked 
to copper delivery to the nucleus. In a search for such a donor, Mac1- 
responsive gene expression was systematically tested in mutants lacking 
each of the known copper metallochaperones™. A dependence on the 
copper donor for superoxide dismutase (Sod1) was discovered; however, 
it transpires that this is only indirect, because Macl needs catalytically 
active Sod] in the nucleus to function®. The mere suppression of oxida- 
tive stress does not overcome the need of Mac] for nuclear Sod1, sug- 
gesting a key role of redox changes in the nucleus for the copper response 
of Mac] (ref. 65) (Fig. 2e). The activity of Sod1, itself an abundant copper 
enzyme, may be the indicator used to monitor copper sufficiency, raising 
many questions about where metal discrimination takes place. 
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Figure 2 | Metal sensing in Saccharomyces cerevisiae. a, Each metal- 
responsive transcription factor has a DNA-binding domain and an 
activation domain (shown as two blocks). Zap1 (orange) has multiple sets 
of zinc-fingers (subscripts indicate the number of zinc atoms in each set). 
When zinc binds to zinc-fingers 1 and 2 (Zfl1 and Zf2) in the activation 
domain (bound zinc indicated by Zn,), activation is repressed. Copper- 
cluster formation (Cu,) in Ace1 (blue-green) encourages DNA binding. By 
contrast, in Macl (blue), copper clusters inhibit activation-domain function 
and weaken DNA binding. Under high-iron conditions, Aft1 (pink) 
becomes extranuclear. These factors generally activate transcription when 
bound to DNA (+), but Zap1 can also act as a repressor (-). The mRNA- 
binding proteins Cth1 and Cth2 (pink) downregulate transcripts encoding 
iron-requiring proteins, and the genes encoding Cth1 and Cth2 are 
controlled by Aft1 and Aft2. b, The metal-responsive transcription factors 
control the expression of genes encoding a subset of metal transporters 
(tubes), which allow the passage of metals through various membranes in 
S. cerevisiae (pink, iron transporters; orange, zinc transporters; and blue, 
copper transporters). The colour of the transporter also corresponds to 
the specific transcription factor involved in regulation: Aft1 and/or Aft2 
(pink), Zap1 (orange), and/or Macl (blue), with several exceptions (Fet4 is 


826 


Acel cy (inactive) 


Metal sensing in animals 

The cells of multicellular eukaryotes can respond to the metal status of 
the whole organism as well as to their own status, and systemic signal- 
ling molecules have been discovered such as hepcidin in animals. Many 
responses to metals in plants and animals occur post-transcriptionally, by 
adjusting messenger-RNA stability or translation, or post-translationally, 
by adjusting intracellular protein trafficking or degradation. With a few 
significant exceptions, the metal-binding, metal-sensing sites that trigger 
these events remain to be discovered. Plant metal homeostasis is reviewed 
elsewhere’®®, but the discovery of Spl7 (Crr in Chlamydomonas), which 
possibly binds (and senses) copper, should be noted. 


Metal regulation of transcription 
Just one metal-binding, metal-sensing transcriptional regulator 
is currently known in animals, the metal-regulatory transcription 
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regulated by Zap1 and Aft1; Cot1 is regulated by Aft1; Zrg17 is regulated 
by Zap1 but Msc2 is not; and Ccc2 is regulated by Aft1). Arrows show 

the direction of metal transport. Fet3 and Fet5 are ferroxidases. Fet4 is 

a different shade of pink because it can transport iron, zinc and copper. 

c, Copper clusters of Macl (or Ace1, not shown) may allow allosteric metal 
specificity. Model of the metal site based on X-ray absorption fine structure 
(EXAFS) data. Spheres represent carbon atoms unless labelled otherwise. 
(Panel reproduced, with permission, from ref. 51.) d, The binding of zinc 
to the zinc-fingers in Zap1 (Zf1 shown in red, and Zf2 in cyan) has been 
followed in vivo using FRET: the energy transfer between enhanced cyan 
fluorescent protein and enhanced yellow fluorescent protein attached to 
Zf2 and Zf1 was measured, providing evidence that zinc occupancy of these 
fingers is modulated in vivo. Zigzag arrows indicate the light (wavelength 
shown) either absorbed or emitted by the fluorescent proteins. (Panel 
modified, with permission, from ref. 54.) e, Redox chemistry is involved 

in regulating the activity of Macl. Macl function depends on the presence 
of active Sod1 in the nucleus, which depends on copper being delivered to 
Sod1 by the metallochaperone Ccs1. The consumption or generation of a 
reactive oxygen species (ROS) by Sod1 might modify regulatory cysteine 
residues on Mac] (ref. 65). 
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factor MTF1, although other DNA-binding zinc-finger proteins, such 
as SP1, do act downstream of unknown metal sensors”. Zinc stabilizes 
a complex between MTF1 and its DNA-binding site, with at least some 
of the sensing mechanism residing in its DNA-binding Cys,His, zinc- 
fingers’°. MTF1 is commonly, although not exclusively, a transcrip- 
tional activator®, acting with other factors”. In the special case of the 
Drosophila MTF1 homologue, some of these interactions are linked to 
metal specificity”’. 

Transcription of mammalian MTF] target genes is induced in response 
to metals such as copper and cadmium, not merely zinc’”””’. Oddly, in 
a cell-free system copper or cadmium have the opposite effect, inhibit- 
ing mammalian MTF1-dependent transcription, but this is reversed 
simply by adding one further component, zinc metallothionein, to the 
reaction mixture”. Addition of apothionein, by contrast, is inhibitory”. 
An explanation is that cadmium and copper displace zinc from binding 
sites such as those of zinc metallothionein; the liberated zinc then binds 
to the zinc-fingers of mammalian MTF1 (Fig. 3a). Here, metal specificity 
is achieved through a combination of metal access and allostery. 

The Drosophila protein MTF1 is distinct in having a copper cluster 
and zinc-fingers”. Flies in which the cysteine residues that coordinate 
the cluster are converted to alanine fail to upregulate transcription of 
a copper-metallothionein gene in response to a high copper concen- 
tration. Hypothetical co-regulators are proposed to interact with Dro- 
sophila MTF1 dependent on cluster formation. Alternative co-regulators 
are thought to enable Drosophila MTF1 to upregulate some genes in 
response to copper excess while upregulating others, such as that encod- 
ing the CTR1B copper importer, in response to copper limitation. Zinc 
can bind to the cysteine thiolate ligands of the copper cluster in vitro 
without impairing subsequent copper binding”. As for yeast copper- 
responsive transcription factors, structurally distinct polycopper clusters 
have the potential to confer allosteric metal specificity by means of the 
unique protein conformations they induce. 


Metal regulation of mRNA stability and translation 

Ina highly parsimonious way, the iron-responsive proteins IRP1 (also 
known as ACO1) and IRP2 (IREB2) control translation or mRNA sta- 
bility reciprocally for different gene products”. A cleft in IRP1 and, by 
inference, IRP2 binds to iron-responsive elements (IREs) that form stem- 
loops in mRNAs” (Fig. 3b). IREs occur in the 5’ untranslated region of 
mRNA encoding the ferritin iron store and in the 3’ untranslated region 
of the mRNA encoding the transferrin receptor for iron uptake. Ferritin 
is needed when iron is in surplus, and transferrin receptor is needed 
when iron is lacking. IRP binding to the 5’ region blocks translation 
whereas binding to the 3’ region stabilizes the mRNA. An iron-sulphur 
cluster binds within, and hence blocks, the cleft of holo-IRP1. Thus, 
holo-IRP1 dissociates from mRNAs to allow translation of ferritin but 
degradation of transferrin-receptor mRNA when iron-sulphur clusters 
are sufficiently abundant to fill its cleft''. As for polycopper clusters, 
binding of iron-sulphur clusters to a sensor may confer allosteric metal 
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specificity, but because iron-sulphur clusters are themselves products 
of a biochemical pathway, perhaps — as noted for yeast Aft! — metal 
discrimination really takes place in the cluster-assembly proteins. 


Post-translational metal regulation 

Metal-responsive degradation of metal-homeostasis proteins is a general 
strategy that applies, for example, to IRP2 (ref. 76), to the copper chaper- 
one CCS”, to the copper importer CTR] (also known as SLC31A1)’*, to 
the iron-efflux protein ferroportin” and to the zinc-uptake transporter 
Z1P4(SLC39A4)"° (Fig. 4). ZIP4 has three levels of post-translational 
regulation in response to zinc. The extracellular amino-terminal half 
of ZIP4 is proteolytically processed during prolonged zinc deficiency”, 
ZIP4 is removed from the plasma membrane to a perinuclear location 
when zinc becomes replete” and ZIP4 is degraded when zinc is highly 
elevated’. The third process involves perception of zinc by a cytoplasmic 
histidine-rich motif that somehow exposes a site for ubiquitination and, 
hence, proteasomal degradation. The same histidine-rich motif triggers 
ZIP4 degradation in response to cadmium, but degradation does not 
occur in response to iron, manganese, nickel, copper or cobalt*®. The 
question of which elements the motif distinguishes through affinity, 
allostery or access awaits exploration. 

As is the case for ZIP4 during the transition between low-zinc and 
zinc-replete conditions, other metal transporters also undergo metal- 
dependent trafficking (not solely for degradation). This was first 
observed for copper-transporting ATPase ATP7A, which redistributed 
from internal trans-Golgi membranes to the plasma membrane in cell 
cultures exposed to high copper concentrations® (Fig. 4). Under low- 
copper conditions ATP7A supplies metal to copper proteins in the trans- 
Golgi network, whereas under high-copper conditions it aids efflux. 
An analogous protein (ATP7B) is expressed in the liver, where it relo- 
cates to apical hepatocyte membranes under high-copper conditions to 
cause efflux of the surplus into bile caniculi™ (Fig. 4). Several mutations 
in ATP7A or ATP7B cause the transporters to remain within internal 
membranes, and increased trafficking to the plasma membrane rather 
than reduced re-internalization by endocytosis is triggered by high cop- 
per concentrations’. A nine-residue stretch of ATP7B is speculated to 
bind a partner in response to copper™. In this model, the copper-sensing 
sites are responsible for exposing the nine-residue signal but the sensor 
remains a mystery. One suggestion is that the acyl phosphate intermedi- 
ate state of the ATPase itself exposes the nine-residue motif™. Trafficking 
of CTR1 in response to copper is also linked to its transport activity, and 
toa methionine-rich motif”. Here, the levels of catalytic activity of CTR1 
and ATP7B become coupled to trafficking, and selectivity of sensing 
becomes a function of the selectivity of transport. 


Systemic signals of metal status 

Hepcidin is a disulphide-rich peptide consisting of 25 amino acids that 
is synthesized and released by the liver in response to increased iron”. 
Hepcidin binds to ferroportin to induce internalization and lysosomal 


Figure 3 | Metal sensors in animals. a, MTF1 
responds to cadmium in vivo (but not in 
vitro) indirectly, as a function of access to 
zinc. Cadmium binds to metallothionein 
(MT) more tightly than does zinc. Zinc 
released from metallothionein is taken up 
by zinc-fingers of MTF1, which in turn 
interacts with metal-response elements 

on DNA to effect the transcription of 

target genes through adaptor proteins”. 

b, Iron-responsive protein IRP1 (a-helices, 
red; B-strands, green) in complex with an 
iron-regulatory element (IRE) from the 

5’ untranslated region of ferritin mRNA 
(purple, in stick format) (Protein Data Bank 
code 2IPY)''. Binding of IRPs to this mRNA 
region inhibits the translation of the iron- 
storage protein ferritin. 
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degradation”. Ferroportin is otherwise responsible for iron export into 
the circulation across the basolateral membrane of enterocytes” (Fig. 4). 
Thus, hepcidin inhibits dietary iron absorption. The mechanism by which 
hepcidin synthesis and release respond to iron is independent of IRPs 
because there are no IREs in the hepcidin mRNA. The iron signal is trig- 
gered by competition between transferrin receptors one and two for the 
hereditary haemochromatosis protein HFE, which is influenced by occu- 
pancy of receptor one with holo-transferrin””. Metal discrimination could 
be said to take place in the formation of diferric transferrin, in which the 
valence state of iron plays a major part in selectivity. 


Outlook 

Bias in favour of the correct metal can occur during folding of a metallo- 
enzyme if coupled to favourable energetics*® (Box 1). Perhaps some 
metal-coordination spheres have evolved to leave patches exposed if an 
incorrect metal binds, to attract folding chaperones. Such a mechanism 
might check that metalloproteins have enfolded the correct metals in 
vivo, a form of proofreading of metal acquisition. To understand metal 
occupancy, methods used decades ago to fractionate metalloproteins 
should be revisited, but equipped today with improved multiple-metal 
detection by inductively coupled plasma mass spectrometry and inor- 
dinately improved protein identification by mass fingerprinting. Math- 
ematical methods such as principal-component analysis circumvent 
the need to purify holoproteins to homogeneity before identification”. 
Metallochaperones can influence the intracellular distribution of metals 
but other kinetic factors, such as interactions with metal importers (or 
exporters from storage compartments) or proximity to sites of metal- 
loprotein degradation, may also create metal niches. 

Metal sensors influence metal availability in cells, which in turn influ- 
ences metal occupancy of metalloproteins. The contribution of allostery 
to metal specificity in bacterial metal sensors is predicted to be especially 
important for metal co-repressors and metal activators, in which metal 
binding to the protein must organize an active protein-DNA adduct. 
Another prediction is that extra ligands may have been selected adjacent 
to some sensory metal sites to lure the wrong metals into non-productive 
binding configurations. A further untested prediction is that access to 
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Figure 4 | Metal-responsive trafficking and processing of metal transporters. 
Mammalian metal transporters (coloured tubes) themselves move within 
the cell in response to metal ions. Depending on whether the metal 
concentrations are high ( | ), low ( J ) or optimal (no arrow) for a particular 
transporter, the transporter can be internalized and transported within 

the cell (curved coloured arrows) and then degraded or cleaved. Most of 

the sites at which metals bind to, and are sensed by, transporters, thereby 
effecting these changes, remain to be discovered. Within an organism, the 
distribution of metal transporters can be widespread (as is the case for CTR1 
and ATP7A) or restricted to particular cell types (all other transporters). In 
addition, in polarized cells, transporters can be expressed on the apical or 
basolateral side. For example, in response to high concentrations of iron, 
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metal is partly a function of the relative affinities of a set of metal sen- 
sors’. For example, provided that a zinc sensor has the tightest affinity 
for zinc in the set, other sensors will never gain access to zinc even if 
they could otherwise bind zinc more tightly than their bona fide metal 
effectors. 

An important question in relation to metal sensing in S. cerevisiae is 
that of what redox chemistry is associated with Aftl and Macl. X-ray 
crystal structures, plus NMR solution studies, of bacterial DNA-bind- 
ing metal sensors in apo form, in holo forms with different metals, as 
multimers and as DNA adducts, have helped visualize metal sensing. 
Equivalent insight is largely missing from eukaryotic metal sensing, 
with one exception being IRP1 in complex with stem-loops ofa ferritin 
transcript"! (Fig. 3b). Recombinant metal-binding domains have been 
examined in isolation, including the zinc-fingers of Zap1 (ref. 55), those 
of Acel homologue Amt] (also known as Mep1)* and of MTF (ref. 90), 
and the copper clusters of Acel (ref. 51), Macl (ref. 51) and Drosophila 
MTFI (ref. 74). Technical advances in determining larger solution and 
crystal structures hold promise for visualizing the apo and holo forms of 
intact eukaryotic metal-sensing transcriptional regulators, particularly 
in complex with co-regulators and with DNA. 

In animals, there are plenty of metal-mediated changes in the abun- 
dance of transcripts that neither contain IRP-binding sites nor are 
encoded by genes with promoters containing the MTF1-recognition 
element. For instance, ZNT5 (also known as SLC30A5) transcripts are 
subject to both zinc-responsive synthesis and stability through unknown 
factors”. Repositories of transcriptomic data catalogue a multitude of 
responses to altered levels of different metals in mammalian tissues or 
cells (see, for example, ArrayExpress (European Bioinformatics Insti- 
tute; http://www.ebi.ac.uk/microarray-as/ae/) and the Gene Expression 
Omnibus (US National Center for Biotechnology Information; http:// 
www.ncbi.nlm.nih.gov/geo/)). To discover unknown DNA- or RNA- 
binding factors, it should be possible to interrogate the sequences of 
metal-responsive genes to identify common nucleotide elements in those 
with similar responses to a metal, thereby identifying bait sequences with 
which to search for cognate metal regulators. Some iron-responsive genes 
contain the nucleotide-binding site for HIF1 (ref. 92), and one unproven 
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the liver produces hepcidin (left), which binds to ferroportin (FPN) in the 
basolateral membranes of intestinal cells known as enterocytes, triggering 
the internalization and degradation of FPN. The amount of iron in the liver 
can thus influence how much iron is transported from the intestine into the 
circulation. Known distributions of metal transporters are as follows: FPN, 
intestine (basolateral), placental trophoblast (basolateral), and monocytes 
and macrophages; ZIP4, pancreatic B-cells, intestine (apical), embryonic 
visceral yolk sac (apical, in mouse) and kidney; ZIP5, pancreatic acinar cells 
(basolateral), embryonic visceral yolk sac (basolateral, in mouse), kidney, 
spleen, liver and intestine (basolateral); ATP7A, polarized cells (basolateral); 
ATP7B, liver (apical), mammary gland (apical), kidney (apical) and placental 
trophoblast (apical). 


© 2009 Macmillan Publishers Limited. All rights reserved 


hypothesis is that HIF1 activity responds to iron through the iron 
dependence ofits prolyl 4-hydroxylase**”’. Extracellular zinc also seems 
to trigger signal pathways through a G-protein-coupled mechanism™, 
suggesting the presence of unknown plasma-membrane metal recep- 
tors. Metazoans also somehow modify the set points for metal sensing 
to sustain different metal concentrations in different tissues, and all cells 
somehow prioritize the expression of different metalloproteins under 
conditions of metal deficiency. 

With so many metabolic processes requiring metals, dyshomeostasis 
is expected to feature widely in pathologies. Aberrant copper transport is 
the cause of Menkes disease (mutations in ATP7A) and Wilson’s disease 
(mutations in ATP7B)®, aberrant zinc transport is a cause of acroderma- 
titis enteropathica (mutations in ZIP4)", and aberrant iron transport isa 
cause of haemochromatosis (mutations in ferroportin, HFE, TFR2, hep- 
cidin or haemojuvelin)*, to name but a minuscule subset of examples. 
Type 2 diabetes mellitus results from interaction between environmental 
and genetic factors, and a linked polymorphism is found in a zinc trans- 
porter that supplies insulin granules in B-cells”’. Perhaps progressive 
accumulations of competitive and/or redox-active metals in the wrong 
locations underlie some links between metals and multiple neurological 
disorders”’. A more complete knowledge of metal homeostasis and, by 
implication, metal sensing is likely to precede an understanding of its 
aberrations. A knowledge of how sensors discriminate between metals 
has the potential to aid the development of therapies. 

Changing metal availabilities over geological time fashioned the metal- 
protein partnerships. As dissolved CO, acidifies the modern oceans”, 
metal availabilities may be changing yet again. The action of carbonic 
anhydrase in providing substrate for oceanic photosynthetic carbon 
fixation by phytoplankton is central to primary productivity and to the 
carbon cycle through its removal of acidifying bicarbonate. It is unclear 
how organisms with carbonic anhydrases of differing metal requirements 
will respond. There is pressing interest in coupling photosynthesis to 
the production of biofuels such as gaseous hydrogen. Hydrogenase will 
have to be supplied with nickel by means of its metallochaperone, but 
the metallochaperone might be unable to acquire enough metal in a het- 
erologous photosynthetic bacterium. We already know that nickel is so 
poorly available in a cyanobacterial cytosol that a mycobacterial nickel/ 
cobalt sensor fails to sense nickel”®. To populate metalloproteins with the 
correct metals in heterologous cells, special strains should be made in 
which bioavailable metal is adjusted, for instance by changing the affini- 
ties of metal sensors to alter the threshold concentrations within which a 
desired metal is buffered. A robust understanding of how metal sensors 
discriminate between the elements is a prerequisite to understanding 
and, hence, to engineering optimal ‘metallation. a 
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INSIGHT 


Function and biogenesis of 


iron-sulphur proteins 


Roland Lill’ 


lron-sulphur (Fe-S) clusters have long been recognized as essential and versatile cofactors of proteins 
involved in catalysis, electron transport and sensing of ambient conditions. Despite the relative simplicity 

of Fe-S clusters in terms of structure and composition, their synthesis and assembly into apoproteins is a 
highly complex and coordinated process in living cells. Different biogenesis machineries in both bacteria 

and eukaryotes have been discovered that assist Fe-S-protein maturation according to uniform biosynthetic 
principles. The importance of Fe-S proteins for life is documented by an increasing number of diseases linked 


to these components and their biogenesis. 


Many proteins depend on cofactors for their function. Typically, these 
low-molecular-mass compounds bind to well-defined and evolu- 
tionarily conserved sequence motifs either covalently or non-cova- 
lently. Organic cofactors include various nucleotides (such as flavin 
mononucleotide and flavin adenine dinucleotide), vitamins (biotin, 
pantothenate and folate) and metal-organic compounds (haem and 
molybdenum cofactor). Common inorganic cofactors include various 
metal ions (Mg”*, Zn*, Mn**, Cu'*””*, Fe**”** and so on), which usu- 
ally bind to mononuclear binding sites. The oldest and most versatile 
inorganic cofactors are probably Fe-S clusters, which can participate 
in electron transfer, catalysis and regulatory processes’. The chemi- 
cally simplest Fe-S clusters are the rhombic [2Fe-2S] and the cubane 
[4Fe—4S] types, which contain iron (Fe**’**) and sulphide (S”). Fe-S 
clusters are usually integrated into proteins through coordination of 
the iron ions by cysteine or histidine residues, yet alternative ligands 
(Asp, Arg, Ser, CO, CN’ and so on) are known, particularly in more 
complex Fe-S clusters. Several different structural folds have been rec- 
ognized to coordinate these simple Fe-S clusters”. Nevertheless, it has 
remained difficult to predict the presence of Fe-S clusters from protein 
sequences. More complex Fe-S clusters, often containing other metal 
ions, are present in, for example, nitrogenase or hydrogenase enzymes, 
but are not discussed here (see pages 814 and 839). 

Fe-S clusters were discovered in the early 1960s by purifying enzymes 
with characteristic electron paramagnetic resonance signals. Some of the 
first Fe-S proteins to be discovered include plant and bacterial ferredox- 
ins and respiratory complexes I-III of bacteria and mitochondria. In the 
late 1960s, chemists and biochemists devised chemical reconstitution 
protocols to assemble Fe-S clusters into apoproteins in vitro, leading to 
the view that these cofactors can assemble spontaneously on proteins’. 
However, genetic, biochemical and cell-biological studies in the 1990s 
provided ample evidence that the maturation of Fe-S proteins in living 
cells is a catalysed process rather than a spontaneous one. In striking 
contrast to the chemical simplicity of Fe-S clusters, their biosynthesis in 
vivo appears to be a rather complex and coordinated reaction* "’. In the 
past decade, numerous biogenesis components were identified, and the 
first insights into the mechanisms of biogenesis were obtained. In this 
Review, I summarize our current knowledge on the biogenesis of Fe-S 
proteins in bacteria and eukaryotes, and emphasize the importance of 
Fe-S proteins for essential physiological pathways in living cells. First, I 


briefly address the properties of various Fe-S clusters and the biological 
functions of Fe-S proteins. 


Functions of Fe-S proteins depend on cofactor properties 

The most common function of Fe-S clusters, electron transfer, is based 
on the propensity of Fe to formally switch between oxidative states +2 
and +3 (ref. 1). Within a given proteinaceous surrounding, Fe-S clus- 
ters can adopt redox potentials from —500 mV to +300 mV (ref. 2). 
Thus, Fe-S clusters can serve as excellent donors and acceptors of elec- 
trons in a variety of biological reactions. Examples are bacterial and 
mitochondrial respiratory complexes I-III, photosystem I, ferredoxins 
and hydrogenases. Another well-studied function of Fe-S clusters is in 
enzyme catalysis, a classic example being aconitase, in which a non- 
protein-coordinated Fe at one edge of a [4Fe-4S] cluster serves as a 
Lewis acid to assist H,O abstraction from citrate (the substrate), which 
is converted to isocitrate. Other special cases are the radical S-adenosyl- 
L-methionine (SAM) enzymes biotin synthase and lipoate synthase, 
which bind two Fe-S clusters each. It is believed that one of these clus- 
ters is disassembled during the formation of the products biotin and 
lipoic acid, respectively, thus serving as a sulphur donor’. Numerous 
other catalytic functions are known for bacterial and eukaryotic Fe-S 
enzymes involved in metabolism’. However, in many cases the precise 
role of the Fe-S cluster is still unclear, and it is therefore possible that in 
some proteins the Fe-S cluster simply plays a structural role. This may 
be true for the recently discovered Fe-S clusters in adenosine triphos- 
phate (ATP)-dependent DNA helicases involved in nucleotide excision 
repair (Rad3, XPD (also known as ERCC2), FANCJ (BRIP1) and so 
on)”°. Likewise, the precise role of the two [4Fe-4S] clusters in the ABC 
protein Rlil (ABCE1 in humans) involved in ribosome biogenesis and 
function is unknown, despite the availability of a crystal structure of 
the holoprotein’®*”. 

A third general role of Fe-S clusters is in sensing environmental or 
intracellular conditions to regulate gene expression (see page 823). Exam- 
ples are the bacterial transcription factors FNR, IscR and SoxR, which 
sense O,, Fe-S clusters and superoxide/NO, respectively. The switch 
between the activating and repressed states depends on the presence or 
absence of an Fe-S cluster (FNR or IscR) or the redox state of the [2Fe-2S] 
cluster present in SoxR™. A classical example for post-transcriptional reg- 
ulation of gene expression by Fe-S clusters is provided by the mammalian 
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cytosolic iron regulatory protein 1 (IRP1, also known as ACO1). Under 
iron-replete conditions, IRP1 holds a [4Fe-4S] cluster and functions as an 
aconitase. When the protein loses its labile cluster under iron deprivation, 
it can bind to stem-loop structures (termed iron-responsive elements 
(IREs)) in certain messenger RNAs of proteins involved in iron uptake, 
storage and distribution in the cell”. Binding of apo-IRP1 to 5’-located 
IREs blocks translation by inhibiting ribosome scanning to the start AUG 
codon, whereas association with 3’-located IREs protects mRNAs from 
nucleolytic degradation, leading to increased translation. 


Dedicated machineries for maturation of Fe-S proteins 
The maturation of bacterial Fe-S proteins has been intensely studied 
in Escherichia coli and the azototrophic (nitrogen-fixing) Azotobacter 
vinelandii. Studies have identified three different systems for the bio- 
genesis of bacterial Fe-S proteins: the NIF system, for specific matura- 
tion of nitrogenase in azototrophic bacteria; and the ISC assembly and 
SUF systems, for the generation of housekeeping Fe-S proteins under 
normal and oxidative-stress conditions, respectively**’*”*. During evo- 
lution, the latter two machineries were transferred by endosymbiosis to 
eukaryotes containing Fe-S proteins in mitochondria, cytosol and the 
nucleus. Mitochondria have retained components homologous to those 
of the bacterial ISC assembly system*”, whereas plastids host proteins 
of the SUF machinery*"”. Assembly of Fe-S proteins in the eukaryotic 
cytosol and nuclei requires the assistance of both the mitochondrial 
ISC assembly machinery and a mitochondrial ISC export system”. 
Additionally, maturation of these Fe-S proteins is dependent on the 
essential cytosolic Fe-S-protein assembly (CIA) machinery, which is 
present in virtually all eukaryotes®*. The yeast Saccharomyces cerevi- 
siae has served as an excellent model organism with which to establish 
the first details of the complex biosynthesis pathways in eukaryotes. 
Recent investigations in human cell culture and other model systems 
have demonstrated that the entire process is highly conserved from 
yeast to human (see below). 

Despite the obvious differences among the various biogenesis systems 


Box 1| Biosynthetic principles of Fe-S-protein biogenesis 


Several systems for the biogenesis of Fe-S proteins have been identified 
in bacteria and in various cell compartments of eukaryotes, namely the 
bacterial NIF system, the bacterial and mitochondrial ISC assembly 
machineries, the bacterial and plastid SUF systems and the eukaryotic 
CIA machinery for cytosolic and nuclear Fe-S proteins. These different 
machineries may follow common biosynthetic rules. The overall 
biogenesis process can be split into two main steps: the de novo assembly 
f an Fe-S cluster on a scaffold protein, and the transfer of the Fe-S 
uster from the scaffold to target apoproteins and its subsequent 
ssembly into the polypeptide chain. Each of these steps involves the 
articipation of several proteins and cofactors, which execute specific 
iosynthetic partial reactions. These reactions may define general 
rinciples that underlie all known biosynthetic systems, but for some 
ystems the partial reactions remain to be identified. 

The bold numbers in the figure correspond to the following steps. 

1) Sulphur donor. A cysteine desulphurase (termed NifS, IscS or SufS in 
bacteria and Nfs1-Isd11 in mitochondria) releases the sulphur required 
for Fe-S-cluster formation from cysteine to produce alanine. As an 
intermediate, a persulphide is formed on a conserved cysteine residue 
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found in bacteria and in eukaryotes, general biosynthetic principles 
seem to underlie the in vivo synthesis of Fe-S clusters and their assembly 
into apoproteins (explained in detail in Box 1). In the following, I briefly 
summarize our current knowledge of the components of the various 
assembly systems and describe their respective mechanistic functions. 
The special assembly pathways of nitrogenases, used by the NIF system, 
and hydrogenases, used by dedicated maturation systems, will not be 
addressed here. The reader is referred to recent reviews™***. 


The ISC assembly systems in bacteria and mitochondria 

The experimental study of Fe-S-protein biogenesis was boosted by the 
identification of the bacterial isc operon”. This discovery not only aided 
work on bacterial Fe-S-protein assembly, but also influenced the first 
attempts to identify biogenesis proteins in eukaryotes. The evolutionary 
relationship between bacteria and mitochondria led to the identifica- 
tion and functional characterization of several mitochondrial proteins 
homologous to the bacterial ISC system”*”*’. The striking similarities 
between the bacterial and mitochondrial ISC components and the 
underlying assembly mechanisms justify a comparative discussion of 
these related systems (Table 1). 

As explained in Box 1, biosynthesis of Fe-S proteins can be separated 
into two main steps. In the ISC systems, an Fe-S cluster is initially and 
transiently assembled on the scaffold proteins IscU (bacteria) and Isul 
(mitochondria), which contain three conserved Fe-S-cluster-coordinat- 
ing cysteine residues”*! (Figs 1 and 2). Then the Fe-S cluster is trans- 
ferred from Isu1/IscU to recipient apoproteins for incorporation into the 
Fe-S apoprotein by coordination with specific amino-acid residues”. 
The first reaction, Fe-S-cluster assembly on Isu1/IscU, critically depends 
on the function ofa cysteine desulphurase as a sulphur donor (Box 1). In 
bacteria, this reaction is performed by IscS, which is highly similar to the 
founding member of this protein family, NifS, involved in nitrogenase 
maturation” (Fig. 1). The crystal structures of several desulphurases 
are known and show a dimeric two-domain protein, with one domain 
harbouring the pyridoxal-phosphate-binding site and a smaller domain 


of the enzyme. This persulphide may then be transferred to conserved 
cysteine residues of helper proteins (SufE) or directly to scaffold proteins 
(see below). 

(2) lron donor. Iron is unlikely to be free in solution. To guarantee 
its accurate delivery to scaffold proteins, specific iron donors may be 
needed. Such a donor function is performed by bacterial (CyaY) and 
mitochondrial (Yfh1) ISC components, which bind iron, the desulphurase 
and the scaffold protein Isu1/IscU. 

(3) Electron transfer. Electrons are needed for the reduction of S° 
(present in cysteine) to sulphide (S*, present in Fe-S clusters). An 
electron transfer function may be provided by ferredoxin reductase 
and ferredoxin of the ISC assembly machineries and by the central, 
ferredoxin-like domain of NifU in the NIF system. 

(4) Scaffold proteins. These proteins serve as a platform for the de novo 
biosynthesis of an Fe-S cluster. They contain conserved cysteine residues 
and bind an Fe-S cluster in a labile manner, meaning that this cluster can 
be transferred to target proteins and stably integrated. The most highly 
conserved scaffolds are bacterial IscU and SufU, and eukaryotic sul. 
Other scaffolds include bacterial NifU, plastid NFU proteins and bacterial 
IscA and SufA. 

(5) Cluster transfer proteins. Specific factors are involved in transferring 
the labile Fe-S clusters bound to the scaffolds to apoproteins, which 
are converted from the apo form to the holo form. And these transfer 
proteins may have three roles: to induce dissociation of the scaffold- 
bound Fe-S cluster; to guarantee its accurate and specific transfer to 
bona fide Fe-S apoproteins; and to promote correct assembly of the Fe-S 
cluster at the proper acceptor sites. In many cases, these cluster transfer 
proteins can be bypassed in experiments in vitro but are essential in living 
cells. Examples are the mitochondrial Ssq1 and Jacl and bacterial HscA 
and HscB chaperones of the ISC assembly systems, and Nar] and Cia of 
the CIA machinery. 
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Table 1 | Compilation of central components involved in Fe-S-protein biogenesis 


Function Bacterial NIF BacterialSUF Bacterial ISC Mitochondrial ISC Eukaryotic CIA 

Cysteine desulphurase, sulphur donor NifS SufS-SufE IscS Nfs1-Isd11 Mitochondrial 
Nfs1-Isd11 

U-type scaffold for Fe-S-cluster assembly NifU (amino-terminal domain) SufU IscU Isul = 

A-type scaffold for Fe-S-cluster assembly IscANt SufA IscA, ErpA Isa, Isa2, lba57? - 

NFU-type scaffold for Fe-S-cluster assembly NifU (carboxy-terminal domain)  - NfuA Nful (also presentin plastids) - 

Scaffold for Fe-S-cluster assembly - SufB - - - 

P-loop NTPase scaffold for Fe-S-cluster = = = Indl? Cfd1-Nbp35 

assembly 

Electron transfer NifU (middle domain) = Fdx Yah1-Arh1 = 

Iron donor = = CyaY Yfh1 (frataxin) = 

Transfer of Fe-S cluster from scaffold to - SufC? HscA, HscB Ssql, Jacl, Mgel, Grx5 Nar1, Cia 


target proteins 


P-loop NTPase, phosphate-binding-loop nucleoside triphosphatase. 


containing the active-site cysteine that transiently carries the sulphur 
released from free cysteine as a persulphide*”*. 

In mitochondria, the cysteine desulphurase comprises a complex 
consisting of the IscS-like desulphurase Nfs1 and the 11-kDa protein 
Isd11 (Fig. 2; refs 37, 38). Although isolated Nfs1 contains the enzymatic 
activity as a cysteine desulphurase and releases sulphide from cysteine in 
vitro, the Nfs1-Isd11 complex is the functional entity for sulphur trans- 
fer from Nfs1 to Isul in vivo. This reaction is aided by direct interaction 
between Nfs1 and Isul (IscS and IscU in bacteria). 

On binding of iron to Isul/IscU, the Fe-S cluster is formed by an 
unknown mechanism*”. The iron-binding protein frataxin (Yfh1 in yeast 
and CyaY in bacteria) is believed to function as an iron donor (Box 1) by 
undergoing an iron-stimulated interaction with Isul—Nfs1 (refs 39-42). 
An alternative view recently suggested by in vitro studies is that CyaY 
functions as an iron-dependent regulator of the biosynthesis reaction 
by inhibiting IscS*. Fe-S-cluster assembly on Isul further depends on 
electron transfer from the [2Fe—2S] ferredoxin Yah1 (Fdx in bacteria), 
which receives its electrons from the mitochondrial ferredoxin reductase 
Arhl and NADH” (Fig. 1). It is likely that the electron flow is needed for 
reduction of the sulphan sulphur (S°) present in cysteine to the sulphide 
(S*) present in Fe-S clusters, but this remains to be verified experimen- 
tally. An additional electron requirement was suggested for the fusion of 
two [2Fe-2S] clusters to a [4Fe-4S] cluster by reductive coupling”. 


ISC machinery 
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The second main step of biogenesis formally comprises the release 
of the Fe-S cluster from Isu1/IscU, cluster transfer to apoproteins and 
its assembly into the apoprotein. However, these three partial reactions 
have not been separated experimentally so far. The overall process is spe- 
cifically assisted by a dedicated chaperone system comprising the Hsp70 
ATPase Ssq1 and the DnaJ-like co-chaperone Jacl (respectively HscA 
and HscB in bacteria). In mitochondria, the nucleotide exchange factor 
Mge1 is also required (Fig. 2), whereas in bacteria the related GrpE seems 
to be dispensable owing to the lability of adenosine diphosphate bound 
to HscA’™. Ssq1/HscA undergoes an ATP-hydrolysis-dependent, highly 
specific interaction with the LPPVK motif of Isul/IscU*’. This complex 
formation and the involvement of Jacl/HscB is thought to induce a struc- 
tural change in Isul/IscU, thereby labilizing Fe-S-cluster binding and, 
thus, facilitating cluster dissociation and transfer to apoproteins” ”. An 
ancillary, non-essential role in Fe-S-cluster transfer from Isu1 to apopro- 
teins is performed by the mitochondrial monothiol glutaredoxin Grx5, 
yet its precise function is unknown”. The plant Grx5 proteins were sug- 
gested to serve as scaffolds for the formation of [2Fe-2S] clusters”. 

The aforementioned ISC proteins are required for generation of all 
mitochondrial Fe-S proteins, but some biogenesis components perform 
a more specific function. The interacting mitochondrial proteins Isa1, 
Isa2 and Iba57 (Table 1) are specifically involved in the maturation of a 
subset of Fe-S proteins, that is, members of the aconitase superfamily 


Bacterial 
Fe-S proteins 


Figure 1| A model for Fe-S-protein biogenesis in bacteria: the ISC and SUF 
machineries. For the maturation of housekeeping Fe-S proteins (to become 
the holo form), bacteria use one of two biogenesis systems or both systems 
in parallel. The ISC machinery (left) consists of the cysteine desulphurase 
IscS, which liberates sulphur (yellow circle) from cysteine, generating an 
IscS-bound persulphide (-SSH) on a conserved cysteine residue. A transient 
Fe-S cluster is formed on the scaffold proteins IscU and/or IscA. De novo 
Fe-S-cluster synthesis involves the transfer of iron (red circle) from the 
iron-binding protein CyaY (and other, unknown, factors). The [2Fe-2S] 
ferredoxin (Fdx) possibly serves to reduce the sulphur in cysteine to sulphide. 
The transiently bound Fe-S cluster is then transferred from the scaffolds to 
apoproteins (Apo) for coordination with specific residues (usually cysteine 
or histidine). Transfer from IscU is aided by the dedicated chaperone system 


HscA-HscB, which binds to a conserved motif of IscU (Leu-Pro-Pro-Val- 
Lys). Whereas the ISC machinery is functional under normal conditions in 
Escherichia coli, the SUF machinery (right) is active under oxidative-stress 
and iron-limiting conditions. A SUF system is also functional in the plastids 
of plants. Biogenesis is initiated by the cysteine desulphurase SufS, which 
functions comparably to IscS except that the sulphur is first transferred to 

a conserved cysteine residue of SufE and bound as a persulphide. Putative 
iron and electron donors in this system are still unknown. Three components 
might be scaffolds for de novo Fe-S-cluster assembly: SufU and SufA, which 
show similarities to IscU and IscA, respectively; and SufB, which forms a 
stable complex with SufC-SufD. SufC is an ATPase and may facilitate Fe-S- 
cluster dissociation from SufB and subsequent transfer to apoproteins. The 
bold numbers refer to the biogenesis steps defined in Box 1. 
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Figure 2 | A model for Fe-S-protein assembly in mitochondria. 
Mitochondria import ferrous iron (Fe”*) from the cytosol in a membrane- 
potential-dependent manner (with the proton-motive force (PMF) asa 
source of energy). Importation is aided by the inner membrane carriers 
Mrs3 and Mrs4 (known as mitoferrin in mammals). The maturation 

of mitochondrial Fe-S proteins (to the holo form) starts with sulphur 
liberation from cysteine by the cysteine desulphurase complex Nfs1-Isd11. 
The synthesis of a transiently bound Fe-S cluster on the scaffold protein 
Isul (and Isu2 in yeast) further depends on the iron-binding protein 
frataxin (Yfh1 in yeast) as an iron donor and the electron-transport chain 
consisting of NADH, ferredoxin reductase (Arh1) and ferredoxin (Yah1), 
which possibly provides electrons for the reduction of sulphur to sulphide. 
The release of the Fe-S cluster from Isu1, and its transfer and incorporation 
into recipient apoproteins (Apo) are facilitated by the ATP-dependent 
Hsp70 chaperone Ssq1, the DnaJ-like co-chaperone Jacl, the nucleotide- 
exchange factor Mge1 and the monothiol glutaredoxin Grx5. Proteins of 
the aconitase family and radical SAM proteins such as biotin synthase 
specifically need Isa1, Isa2 and Iba57 in addition for maturation of their 
Fe-S clusters. Assembly of respiratory complex I also requires the P-loop 
NTPase Ind1. The roles of Nful and GTP are still unclear*’”’. The bold 
numbers refer to the biogenesis steps defined in Box 1. 


and radical SAM proteins” (Fig. 2). Depletion of these proteins results 
in corresponding enzyme defects and auxotrophies. Similarly, a defi- 
ciency of the Isa-protein-related IscA in bacteria, in conjunction with 
the homologous SufA (see below; Table 1), affects the assembly of the 
[4Fe-4S] proteins aconitase and dihydroxy-acid dehydratase, whereas 
the maturation of some [2Fe-2S] proteins such as ferredoxin is unal- 
tered”. The third bacterial member of this protein class, ErpA (Table 1), 
is essential for growth and involved in the maturation of an Fe-S protein 
of isoprenoid biosynthesis”’. Several members of the Isal/IscA protein 
family (Table 1) were shown in vitro to bind an Fe-S cluster by means 
of three conserved cysteine residues in two motifs characterizing these 
proteins**"”°, SufA binds a [2Fe-2S] cluster in vivo that can be trans- 
ferred to both [2Fe-2S] and [4Fe-4S] proteins in vitro”. Together, these 
observations may support the view that the Isal/IscA proteins function 
as alternative scaffolds for a subset of Fe-S proteins (Fig. 1). However, 
the relative specificity of the Isul1/IscU and Isal/IscA scaffolds and their 
functional cooperation will require further scrutiny in vivo to test the 
physiological relevance of this proposal, particularly because IscA was 
also shown to bind mononuclear iron’. 

The mitochondrial P-loop NTPase Ind1 is important for the assembly 
of respiratory complex I (ref. 55) (Fig. 2). On the basis of its homology 
with the cytosolic scaffold-protein complex Cfd1-Nbp35 (see below; 
Table 1), it was proposed that Ind1 serves as a specific scaffold or transfer 
protein for the assembly of the eight Fe-S clusters into complex I. Con- 
sistent with this idea, Ind1 was shown to assemble a labile Fe-S cluster 
that can be passed on to apoproteins in vitro. 


The SUF machinery in bacteria and plastids 

Deletion of the isc operon from E. coli is not associated with a major 
phenotype. Cell viability is affected only when the SUF biogenesis sys- 
tem is simultaneously inactivated””’. The suf genes are organized in an 
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operon that is induced under iron-limiting and oxidative-stress condi- 
tions””® (Table 1). Gene expression from the isc and suf operons is coor- 
dinately regulated by the Fe-S proteins IscR and SufR, which function 
as transcriptional repressors of their respective operons’. During iron 
deficiency or oxidative stress, the apo form of IscR additionally activates 
the suf operon. Thereby, both proteins link the efficiency of Fe-S-protein 
maturation to the extent of gene expression of the two operons. 

Components of the SUF machinery are found in a variety of prokary- 
otes, including Archaea and photosynthetic bacteria. The various SUF 
components fulfil some of the biosynthetic conditions of Fe-S-protein 
biogenesis (Box 1). A complex of SufS and SufE serves as the cysteine 
desulphurase (Fig. 1), in which SufS acts similarly to bacterial IscS or 
NifS and mitochondrial Nfs1-Isd11, but functions mechanistically dis- 
tinctly*. SufE stimulates SufS activity more than tenfold and allows 
the cysteine-bound persulphide intermediate on SufS to be transferred 
to a conserved cysteine residue on SufE, from where it is passed on to 
scaffold proteins*’. Unexpectedly, SufE has a structure similar to the 
IscU-type scaffold proteins, but it is not known to function as one”. 
A specific iron donor and an electron requirement (Box 1) in the SUF 
system are not yet known, but corresponding steps are probably also 
involved in this pathway. Several SUF proteins may provide a scaffold 
function for de novo Fe-S-cluster synthesis, but their relative impor- 
tance and specificity remain to be clarified (Fig. 1). SufA was discussed 
above as a functional IscA homologue. SufB contains several conserved 
cysteine residues that can assemble an Fe-S cluster”’”. SufC is an ATPase 
that is stimulated 100-fold by complex formation with SufB-SufD*™. 
Hence, SufC is a likely candidate for a transfer protein facilitating Fe-S- 
cluster delivery from SufB to target proteins (Box 1). Some bacteria 
contain an IscU-related protein termed SufU that may or may not be 
encoded in the suf operon. Notably, SufU differs from Isul/IscU in 
that it lacks the HscA binding sequence LPPVK of IscU. 

SUF proteins are also present in plastids, reiterating that this biosyn- 
thetic system seems to be less sensitive to high oxygen concentrations. 
The functionality of plastid SufS, SufE and SufA has been confirmed by 
in vitro experiments or bacterial complementation studies, but direct 
experimental evidence for their biogenesis function in planta is usually 
more difficult to achieve''”. It should be mentioned in this context that 
in plastids the SUF proteins may not be the only proteins to support Fe-S- 
protein biogenesis’. An important role, possibly as scaffold proteins, is 
performed by NFU1, NFU2 and NFU3 (also known as Cnful, Cnfu2 and 
Cnfu3), which have homologues in photosynthetic bacteria. NFU pro- 
teins show sequence similarity in a 60-residue segment to the C-terminal 
domain of NifU in bacteria and a similar segment present in Nful in 
yeast, the function of which is unknown (Fig. 2). In particular, plastid 
NFU2 has been examined in more detail and shown to function as a 
scaffold that can assemble a [2Fe—2S] cluster in vitro and transfer it to 
apoferredoxin™. The cnfu2 mutant plants show a dwarf phenotype with 
faint pale-green leaves and a deficiency in photosystem I and ferredoxins 
documenting the important role of NFU2 in Fe-S-protein assembly. 


Biogenesis of cytosolic and nuclear Fe-S proteins 

Fe-S-protein maturation in both the cytosol and the nucleus strictly 
depends on the function of the mitochondrial ISC assembly machin- 
ery” (Fig, 3), but the molecular details of this dependence remain to 
be defined. In human cell culture, small amounts of some ISC proteins 
have been found in the cytosol’”””. A function for the cytosolic human 
homologue of Isu1 in de novo assembly of cytosolic Fe-S proteins could 
not be shown, but the protein may play a role in Fe-S-cluster repair after 
oxidative damage or iron deprivation. Likewise, cytosolic human Nfs1 
does not support Fe-S-protein assembly in the cytosol in the absence of 
mitochondrial Nfs1 (ref. 72). The mitochondria-localized ISC assembly 
machinery is suggested to produce a (still unknown) component (X in 
Fig. 3) that is exported from the mitochondrial matrix to the cytosol, 
where it performs an essential function in the maturation process. 
Because, in particular, Nfs1 is required inside mitochondria to par- 
ticipate in cytosolic and nuclear Fe-S-protein biogenesis in both yeast 
and human cells, compound X is predicted to be a sulphur-containing 
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moiety”. Whether iron is also exported, possibly as part of a pre- 
assembled Fe-S cluster, or joins from the cytosol, is currently unknown. 
The export reaction is accomplished by the ABC transporter Atm1 
(ABCB7 in humans) of the mitochondrial inner membrane”*”*”*. 

Another required component of the export reaction is the sulphydryl 
oxidase Erv1, located in the intermembrane space. This enzyme has 
also been shown to catalyse the formation of disulphide bridges in the 
intermembrane space during Mia40-dependent protein import into the 
intermembrane space”, and thus performs a dual function. Strikingly, 
depletion of GSH in yeast shows a similar phenotype as the downregula- 
tion of Atm] or Erv1, that is, defective cytosolic Fe-S-protein biogenesis 
and increased iron uptake in the cell and mitochondria (see below), 
whereas the assembly of mitochondrial Fe-S proteins is unaffected. 
Hence, Atm1, Ervl and GSH have been described as the ‘ISC export 
machinery’ (Fig. 3). 

Maturation of cytosolic and nuclear Fe-S proteins crucially involves 
the cytosolic Fe-S-protein assembly (CIA) machinery, which comprises 
five known proteins (Table 1). According to recent in vivo and in vitro 
studies, this process can be subdivided into two main partial reactions” 
(Fig. 3). First, an Fe-S cluster is transiently assembled on the P-loop 
NTPases Cfd1 and Nbp35 (refs 77, 78), which form a heterotetrameric 
complex and serve as a scaffold (Box 1). As mentioned above, this step 
essentially requires the mitochondrial ISC machineries. From Cfd1- 
Nbp35, the Fe-S cluster is transferred to apoproteins, a step that requires 
the CIA proteins Nar] and Cial. Cfd1 and Nbp35 take part in the matu- 
ration of Nar] by assisting the assembly of two Fe-S clusters on this iron- 
only hydrogenase-like protein (Fig. 3). Thus, Nar1 is both a target and 
a component of the CIA machinery, creating a ‘chicken-and-egg’ situ- 
ation for its maturation process. Nar1 holoprotein assists Fe-S-cluster 
transfer to target apoproteins by interacting with Cial, a WD40 repeat 
protein that serves as a docking platform for binding Nar] (ref. 79). 
Recently, another CIA component, termed Dre?2, has been identified 
but its precise molecular function is currently unknown™. The protein 
coordinates Fe-S clusters, and is probably both a target and a component 
of the CIA machinery, similar to Nar1. A crucial function of the human 
homologues of Nar1 and Nbp35 in cytosolic Fe-S-protein biogenesis has 
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Figure 3 | The roles of mitochondria and the CIA machinery in Fe-S-protein 
biogenesis in the cytosol and nucleus of eukaryotes. Cytosolic and nuclear 
Fe-S-protein biogenesis requires both the mitochondrial ISC assembly and 
export machineries. The ABC transporter Atm1 of the mitochondrial inner 
membrane exports an unknown compound (X) to the cytosol for use in 
Fe-S-protein assembly, and is assisted by the tripeptide glutathione (GSH) 
and the intermembrane-space sulphydryl oxidase Erv1, which introduces 
disulphide bridges into substrates. In the cytosol, the components of the 
CIA machinery catalyse Fe-S-protein maturation in two main steps. First, 
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been experimentally verified in cultured cells using RNA-interference 
technology to deplete these proteins to critical levels*"”. 


The essentiality of Fe-S-protein biogenesis in eukaryotes 
Most functional studies of eukaryotic Fe-S-protein biogenesis have 
been performed in yeast and practically all of the ISC and CIA compo- 
nents have been first discovered and characterized in this model sys- 
tem. Nevertheless, the process seems to be highly conserved in virtually 
all eukaryotes. This notion has now been documented in functional 
studies in human cell culture, mouse, zebrafish and Drosophila mod- 
els for central components of the ISC and CIA machineries, such as 
Nfs1, Isul, frataxin, Grx5, Atm1, Nbp35 and Nar! (ref. 8). It is likely 
that the remaining components also have functions similar to those 
in yeast. Nevertheless, gene duplications of some of the ISC and CIA 
genes in vertebrates indicate that the process may be more complex 
in such eukaryotes or that the additional components perform other 
(non-Fe-S-related) tasks. 

Most of the ISC and CIA components are essential for the viability of 
yeast and human cells’’”**’. In fact, Fe-S-protein biogenesis is the only 
known function of mitochondria that is indispensable for the viability of 
yeast cells. In comparison, yeast mutants in oxidative phosphorylation 
result in a respiratory growth defect, but they can grow on glucose- 
containing media. Because yeast mitochondria are not known to con- 
tain essential Fe-S proteins (apart from the ferredoxin Yah1, which is 
involved in Fe-S-protein biogenesis), the essential character of the mito- 
chondrial ISC assembly machinery is explained by its role in the matura- 
tion of extra-mitochondrial Fe-S proteins. The first-known example of 
an essential cytosolic—nuclear Fe-S protein is the ABC protein Rlil, an 
Fe-S component involved in ribosome assembly and export from the 
nucleus’®. Maturation of its Fe—S clusters strictly depends on the two 
mitochondrial ISC systems and the CIA machinery providing a tight 
link between mitochondrial function and cytosolic protein translation. 
Recently, two other essential (nuclear) Fe—S proteins were identified, 
one (Rad3) with a function in nucleotide excision repair and the other 
(Pri2) with a function in RNA primer synthesis for DNA replication’*™. 
It seems probable that their maturation requires mitochondria, possibly 
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Fe-S clusters are assembled on the P-loop NTPase complex Cfd1-Nbp35. 
The source of iron and the potential need for electron transfer remain to be 
elucidated. The Fe-S clusters bound to Cfd1-Nbp35 are labile, and, aided by 
the iron-only hydrogenase-like protein Nar1 and the WD40-repeat protein 
Cia1, can be transferred to cytosolic and nuclear apoproteins (Apo). The 

site of involvement of the Fe-S-cluster-containing CIA protein Dre?2 is still 
unknown. Maturation of the two Fe-S clusters on Nar1 (and presumably 

on Dre2) depends on mitochondria and Cfd1-Nbp35 function. The bold 
numbers refer to the biogenesis steps defined in Box 1. 
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Table 2 | Diseases related to Fe-S proteins and their biogenesis 


Human protein Yeasthomologue Function 


Associated disease 


Biogenesis components 


Mitoferrin Mrs3, Mrs4 Mitochondrial carrier; putative iron transporter Erythropoietic protoporphyria”’ 
ISCU Isul Mitochondrial scaffold Myopathy with exercise intolerance’"”” 
Frataxin Yfh1 Iron donor for Isul scaffold Friedreich's ataxia”® 
GLRX5 Grx5 Monothiol glutaredoxin; transfer of Fe-S clusters from Isu1 to apoproteins Microcytic anaemia®” 
ADR Arhl Adrenodoxin reductase; electron transfer from NADH to adrenodoxin; Tumour suppressor 
also involved in steroid hormone biosynthesis 
ABCB7 Atm] ABC transporter, ISC export machinery component X-linked sideroblastic anaemia and cerebellar ataxia” 
Fe-S proteins 
Complex | n.p. Complex | of respiratory chain (NADH:ubiquinone Various mitochondrial diseases such as LHON, MELAS, 
oxidoreductase) Leigh syndrome 
Complex II Complex II Complex II of respiratory chain (succinate dehydrogenase) Tumour suppressor 
(Sdh1, Sdh2, 
Sdh3, Sdh4) 
XPD Rad3 Fe-S protein involved in nucleotide excision repair Xeroderma pigmentosum, Cockayne syndrome, 
trichothiodystrophy 
FANCJ Rad3 Fe-S protein involved in nucleotide excision repair Fanconi anaemia 
MUTYH Ntg2 Fe-S protein (glycosylase) involved in DNA repair Colon cancer 


n.p., not present in S. cerevisiae. 


linking these organelles to other fundamental processes of life. This 
indicates that mitochondria and the process of Fe-S-protein maturation 
have been maintained in evolution to mature extra-mitochondrial Fe-S 
proteins with an essential function in gene expression. This intimate 
connection between the endosymbiontic host and the bacterial ancestor 
of mitochondria may have been decisive for the maintenance of these 
organelles in eukaryotes. 

The crucial role of mitochondria in Fe-S-protein biogenesis has raised 
the pressing question of how ‘amitochondriate’ organisms lacking clas- 
sical mitochondria (such as Giardia, Microsporidia and Entamoeba) 
assemble their Fe-S proteins. Genomic information on several of these 
organisms has provided evidence for the presence of homologues of 
the ISC assembly and CIA machineries, whereas genes for classical 
functions of mitochondria, such as respiration, haem biosynthesis 
or the citric-acid cycle, are missing. Cell-biological studies of several 
amitochondriate species have shown that the organisms harbour small 
double-membrane-bounded organelles, termed mitosomes, that con- 
tain homologues of the mitochondrial chaperones Hsp60 and Hsp70 
as marker proteins****. It is now accepted that mitosomes descended 
from classical mitochondria by reductive evolution, that is, by succes- 
sively losing functions that became dispensable owing to the usually 
intracellular lifestyle of these parasites. Localization and functional stud- 
ies have shown that mitosomes from Giardia and Microsporidia in fact 
contain a functional set of ISC assembly proteins*”™*. Fe-S-protein bio- 
genesis therefore seems to be the remnant function of these organelles, 
explaining their maintenance in these species. This, however, does not 
readily clarify why the process of Fe-S-protein biogenesis still depends 
on mitosomes, that is, why the pathway still is compartmentalized and 
apparently cannot normally be taken over by the cytosol. 

The central importance of Fe-S-protein biogenesis in mammals is 
impressively documented in numerous diseases associated with defects 
in Fe-S-protein biogenesis components or Fe-S proteins (Table 2). 
Depletion of frataxin, the putative iron donor for Fe-S-cluster forma- 
tion on Isul, causes the neurodegenerative disease Friedreich’s ataxia, 
which is associated with Fe-S-protein activity defects and iron accu- 
mulation. Defects in the glutaredoxin Grx5 and in mitoferrin (Fig. 2) 
are associated with diseases exhibiting mainly haematological pheno- 
types, namely microcytic anaemia and erythropoietic protoporphyria, 
respectively”’”’. This surprising connection between Fe-S-protein 
biogenesis and haematopoiesis was experimentally explained by the 
fact that Fe-S-protein biogenesis is crucial for the maturation of IRP1, 
which regulates the synthesis of several proteins involved in iron uptake, 
storage and utilization (see above). One such example is the erythroid 
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6-aminolaevulinate synthase (ALAS2), catalysing the committed step 
of haem biosynthesis in mitochondria. Translation of this protein in 
the cytosol is largely decreased when iron is scarce or Fe-S-protein bio- 
genesis is hampered, intimately integrating the efficiency of haem and 
Fe-S-protein biosynthesis, the two major iron-consuming processes 
in the cell. An additional connection between haem and Fe-S-protein 
biosynthesis may be the presence of an Fe-S cluster in mammalian fer- 
rochelatase, the final enzyme of haem biosynthesis. A splicing defect in 
the JSCU gene in humans results in a myopathy with exercise intolerance 
and intracellular iron overload’”’. Some Fe-S proteins appear to be con- 
nected to human disease (Table 2). Succinate dehydrogenase (complex II 
of the respiratory chain) and adrenodoxin reductase (an ISC member, 
but not an Fe-S protein) have been described as tumour suppressors’. 
Mutations in the putative human Fe-S proteins XPD and FANCJ cause 
multiple disease phenotypes including xeroderma pigmentosum and 
Fanconi anaemia’. 


Impact of Fe-S-protein biogenesis on other cellular processes 
Defects in Fe-S-protein biogenesis in both bacteria and eukaryotes have 
a severe impact on several other cellular processes. On the one hand, this 
connection is explained by the function of particular Fe-S proteins in 
pathways such as amino-acid biosynthesis, the citric-acid cycle, respira- 
tion, cofactor biosyntheses (haem, biotin, lipoic acid and molybdenum 
cofactor) and gene expression (DNA replication, DNA repair and trans- 
lation). On the other hand, some ISC components perform a second 
(moonlighting) function in other cellular processes. For instance, thio 
modification of transfer RNAs requires IscS in bacteria and mitochon- 
drial Nfs1—Isd11 asa sulphur donor*”*™*. Thio and other modifications 
of tRNAs depend on the function of Fe-S proteins both in bacteria (for 
example, MiaB) and eukaryotes (for example, Elp3). Furthermore, the 
mitochondrial [2Fe-2S] ferredoxin Yah1, together with its reductase 
Arh] and Cox15, is required for the hydroxylation of protohaem for the 
synthesis of haem A of cytochrome oxidase. The mammalian homologue 
of Yah1, adrenodoxin, plays a long-known role in steroid biosynthesis. 
An intimate connection exists between Fe-S-protein biogenesis in 
mitochondria and iron regulation in the cell. The mechanisms of this 
regulatory pathway are fundamentally different in yeast and mammalian 
cells, yet the input (iron requirement for Fe-S-protein biogenesis) and 
the outcome (iron-uptake regulation) of the regulatory chain are similar. 
In yeast, iron uptake and intracellular distribution are mainly regulated 
by a transcriptional mechanism involving the transcription factors Aft1 
and Aft2 (ref. 95). In the absence of iron, Aft] and Aft2 translocate to 
the nucleus, where they activate genes of the iron regulon including iron 
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transporters, siderophore transporters and proteins for intracellular iron 
distribution. Low iron concentrations inside mitochondria or defects in 
mitochondrial Fe-S-protein biogenesis (that is, ISC protein deficiencies) 
activate Aftl—Aft2 and lead to a concomitant iron accumulation in mito- 
chondria. Thus, mitochondria have a crucial role in regulating cellular iron 
homeostasis. The regulatory mechanism includes both the ISC assembly 
and ISC export systems, suggesting that the same ora similar molecule as 
that exported by Atm] to the CIA machinery may communicate the iron 
status of the mitochondrial matrix to Aftl and Aft2 (ref. 96). Strikingly, 
the yeast CIA proteins have no major role in iron sensing and regulation, 
refuting the idea that a canonical cytosolic or nuclear Fe-S protein may be 
responsible for the regulatory function in iron homeostasis. 

In mammals, cellular iron regulation occurs predominantly at the 
post-transcriptional level and involves both the Fe-S protein IRP1 (see 
above) and iron-regulatory protein 2, which has no Fe-S cluster yet is 
degraded under iron-replete conditions". Insertion of the Fe-S cluster 
into IRP1 has been shown to require mitochondrial Nfs1, Isu1, frataxin, 
Grx5, ABCB7, Nbp35 and Narl (refs 71-74,81,82,89,97). Hence, in 
mammals, both the ISC and the CIA machineries are critical for the 
regulation of cellular iron uptake, creating the notable difference from 
yeast, in which the CIA machinery is not needed. 


Outlook 

Despite the impressive progress in research on Fe-S-protein biogenesis 
during the past decade, exciting years of further exploration are ahead 
of us, ranging from basic structural, biochemical and cell-biological 
research to physiology and molecular medicine. Only some of the most 
pressing problems to be solved in future research can be mentioned here. 
The solving of three-dimensional structures of the various ISC and CIA 
proteins, in particular of the holo forms of different Fe-S scaffolds, will 
facilitate the elucidation of the molecular mechanisms of de novo Fe-S- 
cluster assembly in living cells. It will be crucial to combine in vitro and 
in vivo approaches to verify the physiological relevance of findings with 
isolated proteins. The specificity of, and differing needs for, the various 
scaffold proteins has to be clarified to understand their individual roles. 
The mechanisms of Fe-S-cluster transfer from the scaffold proteins must 
be dissected into partial steps, and the precise roles of the transfer-assist- 
ing proteins will have to be understood. It will be important to elucidate 
the conversion of [2Fe-2S] clusters to [4Fe—4S] clusters and define the 
function of participating biogenesis components. 

Novel biogenesis components, in particular the iron and electron 
donors of the SUF and CIA machineries, will be identified and their func- 
tion characterized. A particularly pressing question is the isolation and 
chemical characterization of the substrate exported from mitochondria 
to assist extra-mitochondrial Fe-S-protein biogenesis. This achievement 
will crucially depend on the development of an in vitro reconstitution 
protocol for the export process. It is possible that new moonlighting func- 
tions of Fe-S-protein biogenesis components will be identified. In that 
respect, it will be interesting to clarify the function of the nuclear form of 
Nfs1. Advances in isolating mitosomes will make it possible to test their 
suggested function in cellular Fe-S-protein biogenesis experimentally. 
Finally, new diseases associated with this fundamental process of life will 
allow us to understand the physiological consequences of Fe-S-protein 
defects. It is clear that thorough molecular insights into the mechanisms 
of Fe-S-protein biogenesis are prerequisite for the future development 
of therapeutic strategies in the treatment of Fe-S diseases. a 
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Molybdenum cofactors, enzymes 


and pathways 


Giinter Schwarz’, Ralf R. Mendel? & Markus W. Ribbe?® 


The trace element molybdenum is essential for nearly all organisms and forms the catalytic centre of alarge 
variety of enzymes such as nitrogenase, nitrate reductases, sulphite oxidase and xanthine oxidoreductases. 
Nature has developed two scaffolds holding molybdenum in place, the iron-molybdenum cofactor and pterin- 
based molybdenum cofactors. Despite the different structures and functions of molybdenum-dependent 
enzymes, there are important similarities, which we highlight here. The biosynthetic pathways leading to 
both types of cofactor have common mechanistic aspects relating to scaffold formation, metal activation 

and cofactor insertion into apoenzymes, and have served as an evolutionary ‘toolbox’ to mediate additional 


cellular functions in eukaryotic metabolism. 


Molybdenum is bioavailable as molybdate (MoO,” ). Once molybdate 
enters the cell, it is subsequently incorporated by complex biosynthetic 
machineries into metal cofactors”. All enzymes that depend on molyb- 
denum catalyse redox reactions by taking advantage of the versatile 
redox chemistry of the metal, which is controlled by the cofactor itself 
and the enzyme environment’. Within the enzyme, molybdenum shut- 
tles between three oxidation states (+4, +5 and +6), thereby catalysing 
two-electron reduction—oxidation (redox) reactions. In most cases, 
regeneration of the active site involves single-electron steps, resulting 
in a paramagnetic molybdenum intermediate. Molybdenum enzymes 
are found in nearly all organisms, with Saccharomyces as a prominent 
eukaryotic exception*. Many anaerobic archaea and some bacteria are 
molybdenum independent but require tungsten for their growth. Tung- 
state, which is 100-fold less abundant than molybdate, is enriched in 
deep-sea hydrothermal vents, reflecting conditions on the primitive 
Earth. Many of the known tungsten-dependent hyperthermophilic 
bacteria and archaea are found in such vents”. 

In nature, two very different systems have developed to control the 
redox state and catalytic power of molybdenum, which functions as an 
efficient catalyst in oxygen-transfer reactions. In either case, at least three 
sulphur and two oxygen atoms form ligands to molybdenum (Fig. 1). 
One type of molybdenum cofactor is the iron-sulphur-cluster-based 
iron—-molybdenum cofactor (FeMo-co) that is unique to the molyb- 
denum nitrogenase’, with one [4Fe-3S] and one [Mo-3Fe-3S] partial 
cubane bridged by three sulphides and one 1, central atom, X (which 
may be carbon, oxygen or nitrogen)’. The molybdenum of FeMo-co is 
further coordinated by homocitrate (Fig. 1a). The core structure of the 
other type of molybdenum cofactor (Moco) is a pterin-based cofactor 
(molybdopterin or metal-binding pterin (MPT)), with a C6-substituted 
pyrano ring, a terminal phosphate and a unique dithiolate group binding 
molybdenum*. The metal can be attached to one or two pterin moieties 
with additional terminal oxygen and sulphur ligands (Fig. 1c, d). Both 
cofactors are oxygen sensitive and very unstable outside their respective 
apoenzymes. 

In this Review, we give a short overview of the different families 
of molybdenum-containing enzymes, focusing on the biosynthetic 


machineries that lead to the ‘biological’ activation of the metal in dif- 
ferent molybdenum cofactors. 


Molybdenum-dependent enzymes 

On the basis of cofactor composition and catalytic function, molybde- 
num-dependent enzymes can be grouped into two categories: bacterial 
nitrogenases containing an FeMo-co in the active site, and pterin-based 
molybdenum enzymes. The second category is divided into three fami- 
lies, exemplified by sulphite oxidase, xanthine oxidase and dimethyl 
sulphoxide reductase (DMSOR), which each have a distinct active- 
site structure’ (Fig. 1). Tungsten-dependent formate dehydrogenase is 
classified as part of the DMSOR family, whereas aldehyde:ferredoxin 
oxidoreductases form a separate family of tungsten-cofactor (W-co) 
containing enzymes found only in archaea. The biochemistry of W-co 
enzymes has been summarized very recently”. 


Molybdenum nitrogenase 

Nitrogenases provide the biochemical machinery for nucleotide-depend- 
ent reduction of dinitrogen (N,) to ammonia (NH,)°. The overall reaction 
catalysed by nitrogenases is usually depicted as N, + 8H" + 16MgATP + 
8e > 2NH,+ H,+ 16MgADP + 16P,, where P; denotes an inorganic 
phosphate. This reaction not only represents a major entry point of 
reduced nitrogen into the global nitrogen cycle, but also embodies the 
complex chemistry of breaking the triple bond of N, under ambient 
conditions (see page 814). Three homologous nitrogenase systems have 
been identified so far’. The best-characterized molybdenum nitrogenase 
is a binary enzyme system comprising two redox-active metalloproteins 
(Fig. 2a). One, designated Fe-protein, is an a, homodimer with one 
[4Fe—4S] cluster bridged between the subunits and one MgATP-bind- 
ing site located in each subunit; the other, termed MoFe-protein, is an 
a,$, heterotetramer containing two unique metal centres: the P-cluster, 
an [8Fe-7S] cluster that is ligated between each af-subunit dimer; and 
the FeMo-co, a [Mo-7Fe-9S—X-homocitrate] cluster (where X may 
be carbon, oxygen or nitrogen) that is buried within each a-subunit””” 
(Fig. 1a). Catalysis by molybdenum nitrogenase probably involves the 
repeated association and dissociation of Fe-protein and MoFe-protein 
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Figure 1| Molybdenum-containing cofactors. Chemical and three- 
dimensional structures of FeMo-co (a), Mo-bis-MGD (b) and the Moco 
from enzymes of the sulphite oxidase (c) and xanthine oxidase (d) families. 
The three-dimensional structures were taken from the crystal structures 


and ATP-dependent transfer of electrons from the [4Fe-4S] cluster of Fe- 
protein, through the P-cluster, to the FeMo-co of MoFe-protein, where 
substrate reduction eventually takes place®. 


Molybdenum enzymes with a pterin cofactor 

More than 50 different pterin-containing molybdenum enzymes 
are known and classified on the basis of the coordination chemistry 
of molybdenum in their active site'’"’. All eukaryotic molybdenum 
enzymes belong exclusively to either the sulphite oxidase or the xan- 
thine oxidase family with a simple MPT-type cofactor (Fig. 1c, d). They 
differ in the nature of the third Mo-S ligand, which is either provided 
by an enzyme-derived cysteine (sulphite oxidase) or as a terminal sul- 
phido ligand (xanthine oxidase). By contrast, members of the DMSOR 
family coordinate molybdenum by two pterin moieties, each carrying 
in addition guanosine monophosphate, which together form a Mo-bis- 
MPT guanine dinucleotide cofactor (Mo-bis-MGD; Fig. 1b). Detailed 
mechanistic aspects of all these enzyme families have been discussed 
elsewhere*’?"™*. 

Members of the DMSOR family are very diverse in reaction, func- 
tion and structure''. Whereas DMSOR of Rhodobacter sphaeroides is 
monomeric and lacks other redox centres, DMSOR from Escherichia 
coli isa membrane-bound enzyme composed of three subunits (DmsA, 
DmsB and DmsC) with high similarity to E. coli dissimilatory nitrate 
reductase” (Fig. 2b). Both represent complex members of this fam- 
ily and additionally harbour a [3Fe-4S] cluster in the catalytic Mo- 
bis- MGD-containing subunit (NarG), four [4Fe—4S] clusters in the 
electron-transfer subunit (NarH) and a pair of type-b cytochromes in 
the membrane-spanning subunit (Narl), together forming a dimer of 
trimers’*. By contrast with the tricyclic pyranopterin structure of Moco 
in all known molybdenum and tungsten enzymes, NarG-NarH-NarI 
shows the presence ofan open bicyclic structure in one of the pterins””, 
suggesting a reversible cyclization during catalysis. Most members of 
the DMSOR family function under anaerobic conditions whereby their 
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of nitrogenase’, NarG-NarH-Narl”, Pichia pastoris nitrate reductase” 
and bovine xanthine oxidase”. Structures are shown in ball-and-stick 
presentation (Mo, cyan; C, grey; O, red; S, yellow; Fe, brown; N, blue) and 
rendered with PYMOL (http://www.pymol.org/). 


respective cofactors serve as terminal electron acceptors in respiratory 
metabolism. 

Animal sulphite oxidase and eukaryotic nitrate reductase are dimeric 
enzymes forming another family of molybdenum enzymes with a high 
degree of structural conservation’*””, In addition to the molybdenum 
domain, they both harbour a cytochrome-b,-type haem domain that 
either receives electrons from the molybdenum centre (sulphite oxidase; 
Fig. 2c) or donates electrons to the molybdenum site (nitrate reductase). 
However, animal sulphite oxidase shuttles electrons derived from sul- 
phite oxidation towards cytochrome c, whereas nitrate reductase receives 
electrons needed for nitrate reduction from NADPH. The catalytic cycle 
of sulphite oxidase involves sulphite oxidation coupled to molybdenum 
reduction, followed by two individual electron-transfer steps through 
the cytochrome b; domain to cytochrome c, a process associated with 
large spatial movements of the sulphite oxidase haem domain”. Sulphite 
oxidases and homologous enzymes are also found in plants and bacteria, 
where they either form a homodimer of two molybdenum subunits 
lacking the haem domain (plants)'*"’ or assemble into a heterodimer 
ofa single molybdenum- and cytochrome c-containing subunit (bacte- 
ria)”. Sulphite dehydrogenase from Starkeya novella is localized in the 
periplasm, whereas plant sulphite oxidase functions in peroxisomes, 
generating H,O, on transfer of electrons to dioxygen”. Plant, fungal and 
algal nitrate reductases are localized in the cytosol and provide the sec- 
ond major entry point for nitrogen into the living world. By comparison 
with sulphite oxidase, nitrate reductase contains an additional carboxy- 
terminal FAD domain where either NADH (plant and algae) or NADPH 
(fungi) provides reducing equivalents for nitrate reduction”. 

All members of the xanthine oxidase family (also termed molybdenum 
hydroxylases) catalyse hydroxylations of carbon centres (in aldehydes 
and aromatic heterocycles) using oxygen derived from water’”. They are 
molybdo-flavoenzymes forming homodimers in eukaryotes with three 
distinct domains in each subunit (Fig. 2d). Xanthine oxidase is the key 
enzyme in purine degradation, catalysing oxidation of hypoxanthine via 
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xanthine to uric acid. Electrons derived from substrate hydroxylation are 
transferred through [2Fe-2S] clusters and FAD to either molecular oxygen, 
yielding superoxide anions (xanthine oxidase)” or NAD* (xanthine dehy- 
drogenase)”. In bacteria, such as Rhodobacter capsulatus, xanthine dehy- 
drogenase assembles from two different subunits, harbouring the Moco 
on one subunit and the iron-containing and FAD-containing domains 
on the other subunit and showing a remarkable structural and functional 
similarity to the bovine enzyme™”>. Eukaryotic aldehyde oxidases, which 
are derived from ancient gene duplications of xanthine oxidase, convert 
a wide range of aromatic and non-aromatic aldehydes and function in 
detoxification (animals)”° and hormone synthesis (abscisic acid, plants)”. 
Bacterial members of the xanthine oxidase family contain a mono-MPT 
cytosine dinucleotide cofactor and include aldehyde oxidoreductases” 
and carbon monoxide dehydrogenase”, the latter being characterized by 
a special dinuclear Cu-S—Mo centre. Apart from substrate-dependent 
production of reactive oxygen species, xanthine oxidase and aldehyde 
oxidases show NADH oxidase activity with simultaneous production of 
superoxide”, which has a number of proposed physiological functions in 
the metabolism of reactive oxygen species during stress response”. 

Recently a new mitochondria-associated molybdenum enzyme was 
found in mammals” that promotes reduction of N-hydroxylated ami- 
dines in concert with cytochrome b, and cytochrome b; reductase, a 
reaction that may be associated with cellular detoxification. It remains 
unclear what molybdenum-enzyme family this enzyme belongs to, but 
its occurrence is widespread, homologues having been found among 
plants and bacteria. 


Biosynthesis of FeMo-co 

Assembly of nitrogenase FeMo-co is a considerable chemical feat because 
of its complexity and intricacy. Recent progress in the chemical synthesis 
of FeMo-co analogues has provided significant insights into this process”. 
Elucidation of the biosynthesis of FeMo-co, on the other hand, is further 
complicated by the large ensemble of participating gene products’”’. The 
exact functions of these gene products and the precise sequence of events 
in FeMo-co assembly have remained unclear until recently, when the 
characterization of a number of assembly-related intermediates afforded 
a better understanding of this biosynthetic ‘black box’ (Fig. 3). 


Formation of the Fe-S core of FeMo-co 

Assembly of FeMo-co is probably initiated by NifU and NifS, which 
mobilize iron and sulphur for the assembly of small Fe-S fragments 
(see page 831). NifS is a pyridoxal phosphate-dependent cysteine desul- 
phurase and is responsible for the formation of a protein-bound cysteine 
persulphide that is subsequently donated to NifU for the sequential for- 
mation of [2Fe-2S] and [4Fe-4S] clusters**** (Fig. 3). These small Fe-S 
clusters are then transferred to NifB and further processed into a large 
Fe-S core that possibly contains all the iron and sulphur necessary for 
the generation of a mature cofactor®*. The exact function of NifB in this 
process is unclear. Nevertheless, NifB is an indispensable constituent of 
FeMo-co biosynthesis, as deletion of nifB results in the generation of a 
cofactor-deficient MoFe-protein”’. Sequence analysis indicates that NifB 
contains a CKXXCXXC (where X is any amino acid) signature motif at 
the amino terminus, which is typical for a family of radical S-adenosyl- 
t-methionine (SAM)-dependent enzymes”. In addition, there is an 
abundance of potential ligands in the NifB sequence that are available 
to coordinate the entire complement of iron atoms of FeMo-co'. Thus, 
formation of the Fe-S core on NifB may represent a new synthetic route 
to bridged metal clusters that relies on radical chemistry at the SAM 
domain of NifB. For example, NifB could link two [4Fe-4S] subcubanes 
by inserting a sulphur atom along with the central atom, X, thereby 
building a fully complemented Fe-S core that could be rearranged later 
into the core structure of FeMo-co (Fig. 3). 


Insertion of molybdenum into the Fe-S core on NifEN 

The function of NifEN (NifE-NifN) as a scaffold protein for FeMo-co 
maturation was initially proposed on the basis of a significant degree of 
sequence homology between NifEN and the MoFe-protein, which has 
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led to the hypothesis that NifEN contains a ‘P-cluster site’ that houses 
a P-cluster homologue and an ‘FeMo-co site’ that hosts the conversion 
of FeMo-co precursor to a mature cofactor’*’. Whereas the P-cluster 
homologue in NifEN was identified earlier as a [4Fe—4S] cluster®, a 
molybdenum-free precursor of FeMo-co was captured on NifEN only 
recently”. Iron K-edge X-ray absorption spectroscopy reveals that 
this precursor closely resembles the Fe-S core of the mature FeMo-co 
despite slightly elongated interatomic distances” (Fig. 3). This finding 
implies that, instead of being assembled by the previously postulated 
mechanism that involves the coupling of [4Fe-3S] and [Mo-3Fe-3S] 
subclusters, the FeMo-co is assembled by having the complete Fe-S core 
structure in place before the insertion of molybdenum. 

The precursor on NifEN can be converted, in vitro, to a fully comple- 
mented FeMo-co on incubation with Fe-protein, MgATP, molybdate 
and homocitrate”’. Iron and molybdenum K-edge X-ray absorption 
spectroscopy reveals that the FeMo-co on NifEN is nearly identical in 
structure to the native cofactor in MoFe-protein, except for an asym- 
metric coordination of molybdenum that is probably due to the presence 
ofa different ligand environment at the molybdenum end of the cofac- 
tor in NifEN*'. Homocitrate is supplied by NifV (that is, homocitrate 


Figure 2 | Three-dimensional structures of representative members of 
molybdenum-containing enzymes. a, Molybdenum nitrogenase from 
Azotobacter vinelandii’; b, E. coli membrane-bound dissimilatory nitrate 
reductase A (NarG-NarH-Narl)"; ¢, chicken sulphite oxidase’’; d, bovine 
xanthine oxidase”. All enzymes shown operate as dimers of which one half 
is shown as a ribbon diagram with bound cofactors as spheres. For clarity, 
in the other half of the dimer, only the cofactors are shown, highlighting 
their spatial arrangement to maintain intramolecular electron transfer. 

The domain or subunit that binds the molybdenum-containing cofactor is 
rendered in orange. The FeMo-co-containing nitrogenase MoFe-protein 
assembles from two different subunits into a heterotetramer (a,$,). Subunit 
compositions and domain structures are depicted below (DD, dimerization 
domain; FAD, FAD domain; Fe, [2Fe-2S] cluster; H, haem domain; Moco, 
Moco-binding domain). A scale bar 100 residues long is provided beneath c 
to indicate protein size. Structures were rendered with PYMOL and coloured 
as follows: Mo, cyan; C, green; O, red; N, blue; S, yellow; Fe, brown. 
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synthase) in vivo”, but molybdenum mobilization within the cell that 
occurs before the intervention of Fe-protein remains a topic of debate**™*, 
Nevertheless, the fact that the cluster is completely converted before its 
exit from NifEN points to Fe-protein having a significant role in FeMo- 
co maturation. 

Fe-protein re-isolated after incubation with molybdate, homocitrate 
and MgATP is ‘loaded’ with molybdenum and homocitrate that can be 
subsequently inserted into the precursor on NifEN*. The molybdenum 
K-edge X-ray absorption spectrum of the loaded Fe-protein is consis- 
tent with a decreased number of Mo=O bonds (two or three instead 
of the four found in molybdate) as well as a decrease in the effective 
oxidation state of molybdenum due to either a change in the formal 
oxidation state of molybdenum or a change in molybdenum ligation. 
Interestingly, the electron paramagnetic resonance spectrum of loaded 
Fe-protein assumes a line shape intermediate between those of the 
MgADP- and MgATP-bound states of the Fe-protein®. This observa- 
tion is consistent with that from the initial crystallographic analysis of 
an ADP-bound form of Fe-protein, in which molybdenum is attached at 
a position that corresponds to the y-phosphate of ATP**. Such an ADP/ 
molybdenum-binding mode (Fig. 4a) may reflect the initial attachment 
of molybdenum to Fe-protein, particularly when the structural analogy 
between phosphate and molybdate is considered. Remarkably, similar 
nucleotide-assisted processes are proposed for the molybdenum inser- 
tion in pterin-based cofactors (see below; Fig. 4b). 


Insertion of FeMo-co into apo-MoFe-protein 

The completion of FeMo-co assembly on NifEN signals the deliv- 
ery of FeMo-co to its destined location in MoFe-protein. The abso- 
lute requirement of intermediary FeMo-co carrier(s) between NifEN 
and MoFe-protein was precluded by the observations of unaffected 
nitrogen-fixing activity of the host after deletions of proposed carrier- 
encoding gene(s)” and direct FeMo-co transfer between NifEN and 
MoFe-protein on protein-protein interactions*'. Sequence comparison 
between NifEN and MoFe-protein reveals that certain residues that 
either provide a covalent ligand or tightly pack FeMo-co within the 
polypeptide matrix of MoFe-protein are not duplicated in the corre- 
sponding NifEN sequence. It is possible, therefore, that the respective 
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cluster sites in NifEN and MoFe-protein are brought into close prox- 
imity, allowing the subsequent diffusion of FeMo-co from its biosyn- 
thetic site in NifEN (low-affinity site) to its binding site in MoFe-protein 
(high-affinity site). On its delivery to MoFe-protein, FeMo-co interacts 
with a number of MoFe-protein residues en route to its target location 
within the protein. Identification of these residues”? was assisted by 
the crystallographic analysis of a P-cluster-intact yet FeMo-co-deficient 
form of MoFe-protein, which contains a positively charged funnel in 
the a-subunit that is of sufficient size to accommodate the insertion of 
the negatively charged FeMo-co”. 


Biosynthesis of pterin-based molybdenum cofactors 
Although widespread in all kingdoms, Moco is synthesized by a con- 
served biosynthetic pathway divided into four steps according to 
the biosynthetic intermediates: cyclic pyranopterin monophosphate 
(cPMP), MPT and MPT-AMP. The biosynthetic pathway has been 
summarized in detail’ with particular focus on plants”’, bacteria” and 
humans”, and is believed to be very similar to W-co synthesis”. In 
prokaryotes a final modification by a nucleotide can occur, whereas in 
MPT-type enzymes Moco maturation either involves a terminal sulphu- 
ration (xanthine oxidase family) or cysteine ligation to the apoenzyme 
(sulphite oxidase family). 


Synthesis of the metal-binding pterin 

Biosynthesis starts with the conversion of GTP into cPMP (previously 
identified as precursor Z; ref. 54) catalysed by two proteins: a radical 
SAM enzyme (for example MoaA in bacteria) harbouring two oxy- 
gen-sensitive [4Fe-4S] clusters, and an accessory hexameric protein 
involved in pyrophosphate release (for example MoaC in bacteria)”. 
MoaA harbours an N-terminal Fe-S cluster involved in radical SAM 
generation and a MoaA-specific C-terminal Fe-S cluster crucial for 
substrate binding”. Although the reaction mechanism of cPMP syn- 
thesis is not yet fully understood, it is well established that each carbon 
of the ribose and purine is incorporated into cPMP*”*. Furthermore, 
the structure of cPMP as a fully reduced pyranopterin with a terminal 
cyclic phosphate and geminal diol (Fig. 5) supports its physicochemical 
properties”. With respect to the observed geminal diol, it remains to be 
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Figure 3 | Biosynthesis of FeMo-co. a, Sequence of events during 

FeMo-co assembly. The biosynthetic flow of FeMo-co is NifU-NifS 

~> NifB > NifEN > MoFe-protein. The combined action of NifU-NifS 
generates small Fe-S fragments on NifU (stages 1 and 2), which are used 
as building blocks for the formation of a large Fe-S core on NifB (stage 3). 
This Fe-S core is further processed into a molybdenum-free precursor 
(stage 4), which can be converted to a mature FeMo-co on NifEN on 
Fe-protein-mediated insertion of molybdenum and homocitrate (stage 5). 
After the completion of FeMo-co assembly on NifEN, FeMo-co is delivered 
to its destined location in MoFe-protein (stage 6). The permanent metal 
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centres of the scaffold proteins are coloured pink; the transient cluster 
intermediates are coloured yellow. HC, homocitrate. b, Structures of 
intermediates during FeMo-co assembly. Shown are the cluster types that 
have been identified (on NifU, NifEN and MoFe-protein) or proposed (for 
NifB). Hypothetically, NifB could bridge two [4Fe-4S] clusters by inserting 
a sulphur atom along with the central atom, X, thereby generating an Fe-S 
scaffold that could be rearranged into a precursor closely resembling the 
core structure of the mature FeMo-co. In the case of the NifEN-associated 
precursor, only the 8Fe model is shown. The potential presence of X in the 
intermediates of FeMo-co biosynthesis is indicated by a question mark. 
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Figure 4 | Proposed mechanisms for molybdate activation in FeMo-co and 
Moco biosynthesis. a, For FeMo-co synthesis, ina MgATP-dependent 
process Fe-protein (stage 1) reduces molybdenum from a more oxidized 
state, such as molybdate (Mo,,), to a more reduced state (Mo,,.4). M0,.4 
probably occupies the position of the y-phosphate of MgATP (stage 2), 
which is released on ATP hydrolysis. Subsequently, Mo,,,, in complex 

with homocitrate, can be inserted into the FeMo-co precursor, resulting 

in the formation of a mature FeMo-co on NifEN. b, In Moco biosynthesis, 
adenylylated MPT (MPT-AMP) and molybdate bind first in a cooperative 
manner to the Cnx1E domain (stage 1); subsequently, Zn”* or Mg”* 
promotes hydrolysis of the pyrophosphate bond in MPT-AMP. Stage 2 
depicts the formation of a hypothetical reaction intermediate (adenylylated 
molybdate), which is thought to represent an unstable transition state that 
will immediately react with MPT, thus replacing bound copper at the MPT 
dithiolate (stage 3). The function of copper is still unknown, and it remains 
unclear whether molybdenum insertion is dependent on copper. According 
to the coordination of adenylylated molybdate, as well as the required 
modification of Moco in molybdenum enzymes of the sulphite oxidase 

and xanthine oxidase families (cysteine binding or sulphuration), released 
Moco is proposed to carry three oxo ligands. 


determined at which point interconversion into a keto function takes 
place™*. The functions of MoaA and MoaC are conserved throughout 
evolution, as eukaryotic orthologues are able to restore Moco biosyn- 
thesis in bacteria”. 

To form the MPT dithiolate, two sulphur atoms are incorporated 
into cPMP by MPT synthase, a heterotetrameric complex of two small 
(MoaD in E. coli) and two large (MoaE in E. coli) subunits. MoaD car- 
ries a sulphur atom as thiocarboxylate at the conserved C-terminal 
double-glycine motif”, which is deeply buried in the large subunit to 
form the active site’. As one sulphur atom is bound per small subunit, 
a two-step mechanism for MPT dithiolate synthesis with the formation 
of a singly sulphurated intermediate has been demonstrated”. MPT 
synthase homologues in higher eukaryotes have been identified and 
characterized’. The expression of human MPT synthase is unusual, as 
both subunits are encoded by a bicistronic messenger RNA®. 

Ina separate reaction, sulphur is transferred to the small subunit of 
MPT synthase (Fig. 5). For this, in E. coli MoeB catalyses the adenylyla- 
tion of the C-terminal glycine residue of MoaD™ in a process that is 
notably similar to the action of the ubiquitin-activating enzyme Ubal®. 
Together with MoaD, which has a ubiquitin-like fold, MPT synthase 
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provides an evolutionary origin for ubiquitin-like protein conjugation. 
AMP-activated MoaD becomes sulphurated by sulphide transfer, which 
is catalysed by a cysteine desulfurase” and a rhodanese”; the latter is 
fused in eukaryotes, as the C-terminal domain, to an MoeB-homologous 
domain™. 


Metal insertion and nucleotide attachment 

On completion of MPT synthesis, the metal is transferred by a multistep 
reaction. Whereas E. coli encodes two separate proteins involved in this 
step, eukaryotes catalyse metal transfer by homologous two-domain 
proteins, such as Cnx1 (plants) and gephyrin (human) (Fig. 5), point- 
ing to a functional cooperation between their domains. The physio- 
logical functions of their domains were discovered by determining 
the crystal structure of the N-terminal G domain of Cnx1 in complex 
with substrate and product”. The latter was found to be MPT-AMP, 
a common intermediate in bacterial and eukaryotic Moco synthesis” 
synthesized by G domains and homologous proteins (MogA in bacte- 
ria)’'. Subsequently, a transfer of MPT-AMP to the E domain in Cnx1 
was demonstrated”. In the presence of divalent cations and molybdate, 
bound MPT-AMP is hydrolysed and molybdenum is transferred to the 
MPT dithiolate, resulting in Moco release. This Moco most probably 
carries two oxo ligands and one OH group depicted (Figs 4a and 5) in 
a deprotonated form”, as supported by preliminary spectroscopic data 
derived from a storage-protein-bound Moco (see below; G.S., unpub- 
lished observation). There is no experimental evidence for a reduction 
of molybdenum at this state. 

W-co biosynthesis is believed to be conserved up to MPT formation’, 
with differences in metal transfer. The tungsten-dependent archaeon 
Pyrococcus furiosus and related thermophiles lack mogA; instead, they all 
express genes encoding an MoaB-like protein, which also catalyses MPT 
adenylylation, confirming MPT-AMP as an essential and general pre- 
requisite before metal insertion”’. Furthermore, P. furiosus expresses two 
different MoeA-like proteins, suggesting metal-selective activities’. 

Finally, enzymes of the DMSOR family need to be further modified 
by the attachment of a nucleotide molecule (Fig. 5), a reaction depen- 
dent on the preceding metal insertion”. In E. coli, MobA catalyses the 
conversion of MPT and GTP to Mo-bis-MGD™. Interaction studies 
with proteins catalysing metal insertion and Mo-bis-MGD formation 
identified a transient Moco-synthesizing machinery comprising MogA, 
MoeA, MobA and molybdenum-enzyme-specific chaperones”. 


Cofactor maturation, storage and transfer 
Molybdenum hydroxylases such as aldehyde oxidase and xanthine 
oxidase require a final step of maturation to gain enzymatic activity, 
namely the addition of a terminal sulphido ligand to the molybde- 
num centre, which is catalysed by a Moco sulphurase (that is, Aba3 in 
plants or HMCS (also known as MOCOS) in humans), a two-domain 
protein” acting as a homodimer (Fig. 6). In a pyridoxal phosphate- 
dependent manner, the N-terminal NifS-like domain abstracts sulphur 
from L-cysteine and forms a persulphide intermediate on a conserved 
cysteine residue’’. Subsequently this sulphur is transferred via a sec- 
ond cysteine persulphide intermediate to bound Moco. Both of these 
steps are catalysed by the C-terminal Moco-binding domain of Aba3 
(ref. 78), which selectively stabilizes sulphurated Moco. The same 
mechanism operates in HMCS (R.R.M., unpublished observations). 
Among prokaryotes, no homologues to eukaryotic Moco sulphurases 
have been found. However, for xanthine dehydrogenase from R. cap- 
sulatus, its enzyme-specific chaperone XdhC was found to fulfil Moco 
sulphuration”. By contrast with enzymes of the xanthine oxidase fam- 
ily, sulphite oxidase and nitrate reductase incorporate Moco without 
further modification. The proposed tri-oxo coordination of molybde- 
num in mature Moco” (Figs 4b and 5) suggests a simple mechanism 
of cysteine ligation to the molybdenum accompanied by loss of one of 
the oxygens as water. 

As Moco is highly unstable once liberated from proteins, it was assumed. 
that Moco does not occur ina ‘free state’; rather, Moco might be bound toa 
carrier protein that protects and stores it until further use. Whereas some 
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Figure 5 | Biosynthesis of the pyranopterin-based molybdenum cofactors. Shown is a generalized 
scheme of the pathway based on data derived from studies in E. coli, plants and humans. All known/ 
characterized intermediates of the pathway are presented sequentially in the four steps in which Moco is 
synthesized. A fifth step present only in prokaryotes results in the formation of Mo-bis-MGD. Proposed 
(first step)” or partially characterized (second step) intermediates” are indicated in parentheses (circled 
P, phosphate group). The intermediate of the second step has been suggested on the basis of mechanistic 
studies of MPT synthase yielding an intermediate with a linear phosphate®. Proteins catalysing the 
individual steps are depicted in different colours, and a similar shade is applied to domains/proteins 
involved in nucleotide transfer. Homologous proteins from E. coli (Mo nomenclature), humans (MOCS’ 
nomenclature except gephyrin) and plants (‘Cnx’ nomenclature) are shown for comparison. The separate 
pathway regenerating MPT synthase is grouped within the grey box. For simplicity, only the MoeB 
protein is shown. We note that eukaryotes express fusion proteins that contain a MoeB domain and a 
C-terminal rhodanese-like domain involved in sulphur transfer to the small subunit of MPT synthase. 
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bacteria have molybdate-binding proteins such 
as Mop”, the alga Chlamydomonas reinhardtii 
produces a homotetrameric protein® that holds 
four Moco molecules in a surface-exposed bind- 
ing pocket. In higher plants, gene families with 
8-12 homologous Moco-binding proteins have 
been discovered recently (R.R.M., unpublished 
observations). It is still unclear whether these 
proteins represent a buffer in which to store 
Moco or whether they are part of the default 
pathway for Moco allocation and insertion into 
molybdenum enzymes, a mechanism poorly 
understood in eukaryotes. Because Moco is 
deeply buried within the holoenzymes, it needs 
to be incorporated before the completion of fold- 
ing and oligomerization of enzyme subunits/ 
domains; for this, many bacterial molybdenum 
enzymes require the presence of chaperones, 
such as Nar] for E. coli nitrate reductase, TorD 
for trimethylamine N-oxide reductase and 
DmsD for DMSOR, which bind and protect 
the apoenzymes, assist in cofactor insertion and 
control transmembrane targeting. 


Molybdenum homeostasis and disorders 
Cellular uptake 
Bacterial molybdenum uptake requires spe- 
cific systems to scavenge molybdate in the 
presence of competing anions. This involves 
a high-affinity ATP-binding cassette (ABC) 
transporter: molybdate is captured by one 
component, a periplasmic molybdate-binding 
protein (ModA), and transferred to another, 
the transmembrane channel (ModB). The 
crystal structure of an ABC transporter from 
Archaeoglobus fulgidus™ suggests a conserved 
two-state mechanism by which ATP hydrolysis 
and the release of ADP plus P, at the cytoplas- 
mic protein (ModC) controls conformation 
of the transmembrane protein, ModB. For 
tungstate, two ABC-type transporters, TupA- 
TupB-TupC and WtpA-WtpB-WtpC, have 
been identified”, the latter being highly selec- 
tive for tungstate over molybdate owing toa 
unique octahedral substrate coordination®. 
Algae and multicellular plants are the only 
eukaryotes for which the molybdate-uptake 
mechanisms have been recently determined. 
Two proteins belonging to the large sulphate- 
carrier family have been shown to transport 
molybdate with high affinity***. Unexpect- 
edly, none of them was found to reside in the 
plasma membrane. Contradictory reports 
localized them to the endomembrane system” 
or the mitochondrial envelope™. It is likely that 
additional transporters, not only in autotrophs 
but also in animals, will be discovered soon. 


Molybdenum-iron and -copper crosstalk 

Molybdenum metabolism is strictly depend- 
ent on iron metabolism at different levels. 
FeMo-co biosynthesis and nitrogenase matu- 
ration are based on the synthesis of complex 
Fe-S clusters, and enzymes participating in 
the first step of Moco biosynthesis contain two 
[4Fe—4S] clusters*. Furthermore, all molyb- 
denum hydroxylases and several members 


t-Cysteine L-Alanine 
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Figure 6 | Domain structure and function of Moco sulphurase Aba3 from 
Arabidopsis thaliana. The Aba3 protein and other homologues act as 
dimers catalysing the sulphuration of Moco, a reaction needed for the 
activation of molybdenum hydroxylases. Each Aba3 monomer can be 
divided into two domains, an N-terminal domain showing sequence 
homology to cysteine desulphurases (NifS-like enzymes) and a C-terminal 
domain that binds Moco. Recently, it was shown that a persulphide sulphur 
generated by the NifS-like domain is transferred to the C-terminal domain 
for the conversion of bound desulpho-Moco into sulphurated Moco, which 
is subsequently required to activate the target enzymes of Aba3 (that is, 
xanthine dehydrogenase and aldehyde oxidase). PLP, pyridoxal phosphate. 


of the DMSOR family use Fe-S clusters for intramolecular electron 
transfer. Finally, enzymes of the sulphite oxidase family contain haem 
cofactors. 

Recently, another link between the metabolic pathways of molybde- 
num and iron was discovered. In plants (and most probably also in ani- 
mals), enzymes catalysing cPMP synthesis, such as Cnx2 and Cnx3, were 
localized within the mitochondrial matrix (R.R.M., unpublished obser- 
vations), which necessitates the export of cPMP from mitochondria into 
the cytosol. Here, the mitochondrial ABC-type transporter Atm3 (also 
knownas Stal) from A. thaliana seems to fulfil a dual function: it not only 
exports Fe-S-cluster precursors to the cytosol”, but it is somehow also 
involved in cPMP translocation. Atm3-deficient plants showed defects 
in Fe-S-dependent cytosolic enzymes and accumulated large amounts 
of cPMP in mitochondria; consequently, activities of all molybdenum 
enzymes were strongly reduced (R.R.M., unpublished observations). 

Only a few cases and conditions of limited molybdate availability have 
been reported so far*””. Among these, the shortage of molybdenum in 
Australian farmland triggered excessive fertilization, resulting in molyb- 
denum overload of the soil that caused pathological symptoms of molyb- 
denosis in animals; this, in particular in ruminants, triggered secondary 
copper deficiency’. Later, these molybdenum-induced conditions of 
copper deficiency revealed the pathology of two copper-homeostasis 
disorders: Menkes disease (copper deficiency) and Wilson’s disease 
(copper overload)” (see page 823). Consequently, potent copper chela- 
tors such as tetrathiomolybdates were used to treat Wilson’s disease and 
a number of other disorders that are linked to copper homeostasis, such 
as neurodegeneration, cancer and inflammation”. 

Another antagonism between molybdenum and copper has been 
found recently. The crystal structure of Cnx1G, which catalyses MPT 
adenylylation, revealed the presence of a covalently bound copper ion 
(most probably Cu'*) at the MPT dithiolate in both the substrate- and 
product-bound states®. The function of copper during Moco biosyn- 
thesis is still unknown. It may participate in the sulphur-transfer reac- 
tion enabled by MPT synthase, act as a protecting group for MPT and/ 
or function within molybdenum insertion”. In vitro studies suggested 
a competition between copper and molybdenum during Moco forma- 
tion”, ultimately raising the question of whether Moco biosynthesis 
might be affected under conditions of copper overload or deficiency”. 


Molybdenum cofactor deficiency 
Human Moco deficiency (MoCD) results in the complete loss of sul- 
phite oxidase, xanthine oxidase and aldehyde oxidase activity. Patients 
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diagnosed with MoCD are classified into three groups according to 
the affected steps within the biosynthetic pathway (Fig. 7a). They are 
characterized by progressive neurological damage, leading to early 
childhood death in most cases”. Symptoms are mainly caused by the 
deficiency of sulphite oxidase protecting the organs (in particular the 
brain) from elevated concentrations of toxic sulphite. 

The function of sulphite oxidase is important, as it represents the last 
step in the oxidative degradation of sulphur-containing amino acids and 
lipids. Mainly in liver, sequential events in cysteine catabolism through 
cysteine sulphinate (cytosol) and 6-sulphinyl pyruvate (mitochondria) 
lead to the formation of sulphite (Fig. 7b). Sulphite oxidase, localized in 
the intermembrane space, oxidizes sulphite to sulphate. Under condi- 
tions of Moco or sulphite oxidase deficiency, sulphite accumulates in 
plasma and serum, crosses the blood-brain barrier and rapidly triggers 
neuronal death”. Impaired ATP synthesis has been suggested as one 
possible mechanism of sulphite toxicity”. Sulphite accumulation also 
triggers the reduction of cystine (Fig. 7b), the main carrier for cysteine 
in serum and plasma, which consequently causes the formation of S-sul- 
phocysteine™ (Fig. 7b), a potential agonist of glutamate receptors”*. The 
latter may explain the observed seizures, convulsions, contractions and 
twitching associated with MoCD, causing damage of cortical neurons 
as documented by abnormal magnetic resonance imaging of the brain 
and loss of white matter”. 

To study MoCD, an animal model with Mocs1 knockout has been con- 
structed”. Homozygous mice displayed a severe phenotype that reflects 
all biochemical characteristics of human Moco-deficient patients. 
They failed to thrive, and died within the first 12 days of life. The lethal 
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Figure 7 | Human Moco deficiency. a, Classification of Moco-deficient 
patients” according to the three distinguishable steps in human Moco 
biosynthesis. Genes encoding proteins catalysing the individual steps 

are shown. Type-A patients cannot form cPMP, whereas type-B patients 
accumulate cPMP, which is excreted in the urine. So far, only one type-C patient 
has been described, with a deletion of gephyrin due to an early stop codon in 
the gephyrin gene”, a protein also needed for the formation of inhibitory 
synapses. b, Degradation pathway of methionine and cysteine. Catabolic 
intermediates are depicted and metabolites that accumulate in Moco-deficient 
patients owing to increased sulphite concentrations are S-sulphocysteine, 
thiosulphate and taurine. Simultaneously, cystine concentrations are 

very low. S-Sulphocysteine is thought to make a major contribution to 
neurodegeneration because of its structural similarity to glutamate. We note 
that sulphite is formed from spontaneous decomposition of B-sulphinyl 
pyruvate. Sulphite crosses the mitochondrial inner membrane and is oxidized 
in the intermembrane space by molybdenum-dependent sulphite oxidase. 
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phenotype could be effectively neutralized by repeated injection 
of cPMP, which was purified from E. coli”*. Mice treated with cPMP 
developed normally, gained weight and reached adulthood and fertil- 
ity like their wild-type littermates. Notably, withdrawal of cPMP from 
Mocs1-knockout mice caused death within 10-14 days. These prom- 
ising results were verified by a recent clinical trial (G.S., unpublished 
observations). 


Conclusion 
The versatile redox chemistry of molybdenum is mirrored by the 
plethora and complexity of enzymes using molybdenum and, to some 
extent, tungsten. Nature has developed two very different systems to 
control the redox state and catalytic power of molybdenum. Although 
the contribution of each cofactor scaffold to the overall chemistry of a 
given enzyme requires further investigation, it is likely that the various 
scaffolds represent pathways generating sulphur-containing chelators 
that can trap, activate and control the transition elements as catalysts. 
Future research in the field of molybdenum enzymes is likely to focus 
on the mechanistic details of cofactor biosynthesis, cofactor allocation 
and the functions of cofactors in particular enzymes. Further under- 
standing of the differences between molybdenum and tungsten bio- 
chemistry will help to explain the unique presence of molybdenum in 
eukaryotes and to answer fundamental questions regarding functional 
specificity, metal selectivity, regulation, allocation, compartmentaliza- 
tion and assembly of this fascinating family of enzymes and cofactors. 
As seen for both FeMo-co and Moco biosynthesis, different assembly 
machineries can use similar mechanisms, such as the radical SAM-based 
chemistry and the nucleotide-assisted molybdenum activation, which 
are employed for the synthesis of both cofactors. The role of molybde- 
num in other homeostatic circuits, such as copper and iron metabolism, 
awaits further investigation, which could address questions regarding 
the pathophysiology of related metabolic disorders. Finally, the bio- 
synthetic machineries of these cofactors or the enzymes themselves 
(for example nitrogenase) are ancient natural inventions. Some of the 
proteins are precursors of those with specialized functions in eukary- 
otes, such as ubiquitin-like protein conjugation (MPT synthesis) and 
G-protein-based signalling’; others, such as gephyrin, which is crucial 
for synaptogenesis”, have gathered additional functions. a 
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Mechanistic considerations of 


halogenating enzymes 


Alison Butler’ & Moriah Sandy’ 


In nature, halogenation is a strategy used to increase the biological activity of secondary metabolites, 
compounds that are often effective as drugs. However, halides are not particularly reactive unless they 

are activated, typically by oxidation. The pace of discovery of new enzymes for halogenation is increasing, 
revealing new metalloenzymes, flavoenzymes, S-adenosyl-L-methionine (SAM)-dependent enzymes and 
others that catalyse halide oxidation using dioxygen, hydrogen peroxide and hydroperoxides, or that promote 


nucleophilic halide addition reactions. 


Halogenated natural products have been isolated in abundance from 
marine eukaryotic organisms, but also increasingly from microorgan- 
isms that inhabit marine and terrestrial environments (Fig. 1). These 
halogenated compounds range from peptides, polyketides, indoles, ter- 
penes, acetogenins and phenols to volatile halogenated hydrocarbons 
(for example bromoform, chloroform and dibromomethane) that are 
produced on a very large scale’*. Many halogenated marine metabo- 
lites possess biological activities of pharmacological interest, including 
anticancer, antifungal, antibacterial, antiviral and anti-inflammatory 
activities. The biogenesis of these compounds has intrigued scientists 
for decades. 

The first halogenating enzyme to be discovered, in the 1960s, was the 
haem (iron-containing porphyrin) chloroperoxidase (CPO) from the 
terrestrial fungus Caldariomyces fumago, which produces the chlori- 
nated natural product caldariomycin*. Some time later, haloperoxidase 
enzymes were proposed in the biogenesis of certain halogenated marine 
natural products”®, although this was long before haloperoxidases or other 
halogenating enzymes had been discovered in marine organisms. Since 
the discovery of haem CPO, other haem haloperoxidases’ ’, vanadium 
haloperoxidases'””” and non-haem iron halogenases"’, as well as non- 
metallo-, flavin-dependent halogenases“*, SAM-dependent chlorinases 
and fluorinases’’, and methyl halide transferases’*, have been discovered. 
For recent reviews, see refs 14, 15, 17-23. 

Haloperoxidases catalyse electrophilic halogenation reactions, whereas 
the non-haem iron halogenases catalyse radical halogenation reactions 
and SAM-dependent halogenases catalyse nucleophilic halogenation reac- 
tions. Haloperoxidases have traditionally been classified on the basis of 
the most electrophilic halide that is readily oxidized. Thus, chloroperoxi- 
dases oxidize chloride, bromide and iodide by hydrogen peroxide (H,O,), 
whereas iodoperoxidases oxidize only iodide in this way. Hydrogen perox- 
ide lacks the thermodynamic potential to oxidize fluoride; thus, enzymes 
catalysing fluorination are not peroxidases. The overall stoichiometry of 
the haloperoxidase reaction is consumption of one equivalent of H,O, per 
halogenated product produced (Fig. 2, reaction 1). 

One class of halogenase, the non-haem iron enzymes, uses dioxygen 
(O,) to activate halide (Fig. 2, reaction 2); these enzymes catalyse radical 
halogenation reactions at aliphatic carbon sites. The overall stoichiom- 
etry is consumption of one equivalent of a-ketoglutaric acid and O, per 
halogenated equivalent produced (Fig. 2, reaction 2). Flavin-dependent 
halogenases activate O, by means of formation of flavin-bound hydroper- 
oxide, which then oxidizes the halide (Fig. 2, reaction 3). This class of halo- 


genating enzyme is a two-component system in which a flavin reductase 
is required to keep the halogenase reduced for catalytic turnover. These 
enzymes catalyse reactions that more closely resemble the electrophilic 
haloperoxidases that are mediated by ‘HOX-like’ species. Finally, SAM- 
dependent enzymes catalyse nucleophilic substitution, generating 5’-halo- 
5’-deoxyadenosine and L-methionine (Fig. 2, reaction 4). In this review 
we focus on the mechanisms of metalloenzyme-catalysed halogenation 
reactions, emphasizing the activation of halides by O, and H,O,, and the 
nature of the halogenation step. 


OH 


Caldariomycin 


Prepacifenol 


Rees 
( eens 


H,N OH 


Syringomycin E 


Figure 1 | Halogenated natural products. A few examples of halogenated 
natural products that have been discovered in the past few decades. These 
natural products have been isolated from marine eukaryotic organisms, as 
well as microorganisms that inhabit marine and terrestrial environments. 
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Figure 2 | Biologically relevant halogenation xX +H,0,+R-H+H* —» R-X+2H,0 Reaction 1 
reactions. Four distinct classes of enzyme have 
been discovered that carry out halogenation O OH OH 
reactions. Reaction 1, which encompasses AAA +0,+X-+R-CH,+H* — R-CH)X+CO,+ HO we +H,O Reaction 2 
electrophilic halogenation, is catalysed by HO fe) 
haloperoxidases such as the haem-containing fe) ie) 
or vanadate-containing enzymes. Reaction 2 is Roy R 
carried out by a-ketoglutarate-dependent non- NV UNO NVZNV 0 
haem iron halogenases and is characteristic of CL | a O,+X°+R’H+H* —> R-X “CL ag + 2H,O0 Reaction 3 
radical halogenation at unactivated C-H centres. N ne Nid On 
Reaction 3 is catalysed by flavin-dependent H 69 fe) 
halogenases. Reaction 4 shows the SAM- ve ee 
dependent halogenase reaction. X, halogen atom; : 2 7 
R, R’, alkyl group. NH; (ss Sy * SN ‘i 
‘oc SS oN We oO) 7 : as v 
i) +X —> MY + OOC Ss Reaction 4 
HO OH HO OH 
Haem haloperoxidases the marine worm Notomastus lobatus’. This enzyme is unusual in that 


The fungus C. fumago, from which the first haloperoxidase, Fe(11)- 
haem CPO, was derived’, produces the chlorinated natural product 
caldariomycin, which is thought to be synthesized by CPO. The optimal 
pH for chlorination turnover by haem CPO is pH 2.7 (ref. 4). Interest- 
ingly, CPO is used to catalyse a number of oxidation reactions in the 
absence of chloride. These oxidation reactions are generally carried out 
at neutral pH, whereas chlorination occurs only at low pH. The active 
site of Fe(111)-haem CPO contains Fe(111)-protoporphyrin IX coordi- 
nated by the thiolate side chain of a cysteine in the proximal position”. 
Recently the X-ray structure of Fe(m1)-haem CPO was determined with 
the substrate, 1,3-cyclopentanedione (CPD), bound in one of the two 
channels leading to the haem iron centre (Fig. 3)”. In addition, halide- 
binding sites were identified at the protein surface near the opening 
of the second, narrower, channel leading to the haem iron moiety, as 
well as within this channel and at the other end of the channel, near the 
haem iron site”. 

A haem bromoperoxidase isolated from the marine green alga Peni- 
cillus capitatus contains a distal His ligand in place of the cysteine lig- 
and in Fe(m1)-haem CPO”. This enzyme uses bromide preferentially, 
although under certain conditions it can catalyse chlorination reactions’. 
Although the thiolate ligand was initially thought to be important for 
CPO activity, and His for bromoperoxidase activity in the haem haloper- 
oxidases, a haem CPO with a distal His ligand has been isolated from 


Val 67 


Figure 3 | Structure of the active site of haem CPO from C. fumago, witha 
bound substrate, 1,3-cyclopentanedione”. The active site of CPO contains 
Fe(111)-protoporphyrin IX coordinated by the thiolate side chain of a Cys 

in the proximal position. Two channels have been identified that lead to 

the haem active site. The substrate, 1,3-cyclopentanedione (CPD), has 

been located in the broader of the two channels in a recent X-ray structure, 
and halides were located within the narrower channel”. Glu 183 is located 
close to the haem iron centre and is proposed to assist in formation of both 
compound 0 and compound I (Fig. 4). The carbon backbone is shown in teal 
green, oxygen in red, nitrogen in blue, sulphur in yellow and the Fe(111) centre 
in orange; the dashed lines indicate hydrogen bonding. 


it requires a partner flavoprotein for activity. 

The haem iron centre functions as a redox catalyst (Fig. 4). Hydrogen 
peroxide oxidizes the haem Fe(111) centre to compound I, the Fe(1v)-oxo 
m-cation radical species (O=Fe(Iv)-haem”’), via the short-lived com- 
pound-0 state, characterized as a peroxo-anion complex, HOO-Fe(111)- 
haem”*. Glu 183 is proposed to assist in formation of both compound 0 
and compound I. Compound I oxidizes chloride by two electrons, 
reforming the haem Fe(11) centre. Characterization of the state that is 
generated immediately following chloride oxidation has generated much 
discussion”. At issue is whether OCI is a ligand to the Fe(111)-haem 
at the distal site, whether it is released from the enzyme active site or 
whether it is otherwise trapped within the enzyme substrate pocket but 
not coordinated to Fe(111)-haem. The oxidized chlorine intermediate can 
then chlorinate the organic substrate or react with a second equivalent 
of H,O,, producing O, (in the singlet excited state). Although caldari- 
omycin (Fig. 1) is not a chiral compound, interest in stereospecific and 
regiospecific halogenation reactions has been widespread. 


fe) 
Glu1s3 0 
R-Cl + H,O : 
SS 
[RH] — Pane. 
02+ Cr Fe(ill)- a HPO 
HO, 
ie Glu a 5 
SSN rE 7 > i, 
aoaccad 0 
O +H,0 
ele 
Cl ae Sas 
ns 
Compound | 


Figure 4 | Proposed catalytic cycle for haem CPOs”. The Fe(111)-haem 
resting state is oxidized by H,O,, forming compound I. Compound I 
oxidizes chloride by two electrons, reforming the haem Fe(11) centre 

and generating an oxidized chlorine intermediate that is formally at the 
oxidation level of OCI. This oxidized chlorine intermediate could chlorinate 
the organic substrate (shown in red) or oxidize a second equivalent of H,O, 
(shown in blue), depending on the reaction conditions. L, cysteine. 
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Phe 492 


Figure 5 | Structure of the active site of V-CPO from C. inaequalis®. The 
V(v)-containing active site shown here for V-CPO is very similar in all 

the V-HPOs. A conserved His residue is a ligand to vanadate. The carbon 
backbone is shown in teal green, oxygen in red, nitrogen in blue and the V(v) 
centre in grey; the dashed lines indicate hydrogen bonding. 


Numerous kinetic and spectroscopic investigations along with product 
analyses have tried to probe the nature of the halogenating intermediate. 
Several experimental results suggest that the oxidized chlorine interme- 
diate may not escape from the active site during turnover”. A key result 
arising from Cl oxidization by compound I, which is the rate-limiting 
step during turnover, is the *Cl/*’Cl isotope effect. During chlorination of 
1,3,5-trimethoxybenzene or 3,5-dimethylphenol in the presence of excess 
chloride (with a known chlorine isotopic composition, —0.76 + 0.07%o), 
*C] is preferentially incorporated into the chlorinated organic products. 
The kinetic isotope effect, k,-/k,, (the ratio of isotope reaction rate con- 
stants), of 1.012 for 1,3,5-trimethoxybenzene or 1.011 for 3,5-dimeth- 
ylphenol is significantly more than that observed in the uncatalysed, 
molecular chlorination reaction with hypochlorite under otherwise iden- 
tical conditions”. The observed chloride isotope fractionation establishes 
that the oxidized chlorine intermediate is not in exchange equilibration 
with bulk CI, Cl,, OCI, HOCI or other possible chlorine species outside 
the enzyme active site. Thus, this experiment suggests that the oxidized 
chlorine intermediate is trapped in the enzyme active-site pocket and that 
chlorination occurs within the enzyme active-site channel. 


Vanadium haloperoxidases 
The first vanadium haloperoxidase (V-HPO) was discovered in marine 
kelp in the early 1980s'""". Since that time, vanadium bromoperoxidases 


a His 404 
R-X + H,O we 
HO 
| 
[RH] o-vee a 
| oO 
0+ L 
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me ———_ 0726 
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Figure 6 | Mechanistic considerations of V-HPOs. a, Proposed catalytic 
cycle for V-HPOs’*!”*?”, The catalytic reaction is initiated by coordination 
of one equivalent of H,O, to the resting V(v) state of the enzyme. 
Hydrogen bonding from a Lys side chain to V(v)-coordinated peroxide 
most probably assists halide oxidation by increasing the oxidation 
potential of the oxo-peroxo-V(v) moiety. After halide oxidation, the 
oxidized halogen intermediate, ‘OX”, could brominate the organic 
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(V-BPOs) have been isolated from the three major classes of marine sea- 
weed, and vanadium CPOs (V-CPOs) have been isolated from fungi” and 
are predicted to be present in marine bacteria’. V-HPOs are undoubtedly 
important in the biosynthesis of many halogenated marine natural prod- 
ucts (see below), and have recently been shown to be involved in iodide 
accumulation in marine macroalgae (kelp)”’. Although chlorinated natu- 
ral products have not been isolated from the fungi containing V-CPO, 
these enzymes may be important in fungus-mediated chlorination and 
degradation of lignin”. 

The vanadate site in V-HPOs is positioned at the bottom of a channel 
15-20 A deep, at the core of the four-helix bundle scaffold (Fig. 5)». 
Vanadium is coordinated to the protein by a single axial His ligand in a 
trigonal bipyramidal coordination geometry. The three equatorial oxygens 
of vanadate are hydrogen-bonded to two conserved Arg residues, a Lys, a 
Ser and the backbone ofa Gly amide, thereby decreasing the overall nega- 
tive charge around the vanadate ion. The proposed apical water/hydrox- 
ide ligand is hydrogen-bonded to a His. The X-ray structures of V-BPO 
(Corallina officinalis™, Ascophyllum nodosum”) and V-CPO (Curvularia 
inaequalis’) are remarkably similar, and are nearly superimposable when 
the structures are overlaid at the V(v) centre”. The striking similarity of 
the active sites raises questions about the molecular basis of the halide 
selectivity and the reactivity towards halide oxidation. One noticeable 
difference in the overlay is the presence of Phe 397 in V-CPO but His in 
V-BPO. This extra His residue does not interact directly with vanadate, 
although it has been proposed that the His side chain participates as a 
proton donor and acceptor during catalysis””**’ and, thus, could affect 
the overall oxidation potential of the oxo-peroxo- V(v) moiety. 

In contrast to the haem haloperoxidases that function as redox catalysts, 
V-HPOs function as Lewis acid catalysts of halide oxidation by H,O,. The 
catalytic reaction is initiated by coordination of one equivalent of H,O, 
to the resting V(v) state of the enzyme (Fig. 6a). The X-ray structure of 
the peroxo adduct of V-CPO reveals that a Lys side chain is hydrogen- 
bonded to the coordinated peroxide. This is probably an essential feature 
of the catalytic reaction because it would increase the potential of the oxo- 
peroxo-V(v) centre for halide oxidation. The oxo-peroxo-V(v) species 
can then oxidize the halide by two electrons, forming an oxidized halogen 
that is formally at the X* oxidation state or, more probably, at that of the 
hypohalite anion, OX’. Electrophilic halogenation results from reaction of 
‘OX’ either with the organic substrate or, in the absence ofa good organic 
substrate, with a second equivalent of H,O,, forming O, (which has been 
shown to be in the singlet excited state during turnover with Br )** and 
the halide, X-. Radical halogenation reactions have not been observed 
with the V-HPO enzymes. 
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substrate (shown in red) or oxidize a second equivalent of H,O,, forming 
singlet oxygen (shown in blue). b, V-BPO-catalysed formation of a-, B- 
and y-snyderol and (+)-36-bromo-8-epicaparrapi oxide natural products 
from (E)-(+)-nerolidol”’. The formation of single diastereomers of a-, 

B- and y-snyderol suggests that the reaction proceeds through a common 
intermediate, initially formed by electrophilic bromination of a single face 
of the terminal alkene site in (E)-(+)-nerolidol. L, ligand. 
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Initial reports suggested that V-BPO lacked organic-substrate specificity 
because a wide range of brominated organic compounds could be pro- 
duced. However, few compounds are brominated as efficiently as would 
be indicated by the stoichiometric consumption of one equivalent of H,O, 
per brominated organic equivalent produced (Fig. 2, reaction 1). Instead, 
during halogenation reactions of many compounds, far more H,O, is 
consumed in the halide-assisted H,O,-disproportionation reaction than 
brominated product is formed, consistent with escape of HOBr from the 
active site as occurs with poor organic substrates. 

Terpenes, which are the biogenic precursors to many halogenated 
terpene natural products, are substrates for V-BPO that do react stoi- 
chiometrically during turnover, such that one equivalent of H,O, is 
consumed per equivalent of terpene reacted, at least with the red algal 
V-BPOs*". For example, V-BPO catalyses the bromination of the ses- 
quiterpene (E)-(+)-nerolidol, which is accompanied by terpene cycliza- 
tion, and consumption of one equivalent of H,O, in the formation of a-, 
B- and y-snyderol and (+)-3B-bromo-8-epicaparrapi oxide (Fig. 6b)”. 
Single diastereomers of B- and y-snyderol and mixed diastereomers of 
(+)-3B-bromo-8-epicaparrapi oxide (diastereomeric excess, 20-25%) 
are produced in the V-BPO-catalysed reaction with (£)-(+)-nerolidol, 
whereas two diastereomers of these compounds are formed in the syn- 
thesis with 2,4,4,6-tetrabromocyclohexa-2,5-dienone”. 

The formation of single diastereomers of the snyderols in the V-BPO 
reaction indicates that only one face of the terminal alkene of (E)-(+)- 
nerolidol is brominated. Moreover, the halogenating species is trapped 
within the active site, although speciation within the active site, such as 
HOBr/OBr /Br,, V(v)-OBr ora possible bromamine or acyl hypobromite 
species, among other possibilities, cannot be addressed explicitly. Thus, the 
active-site channel would seem to playa role by holding the terpene sub- 
strate in place such that only one face of the terminal alkene is presented 
to the vanadium centre. Differences between the active-site channels, in 
terms of terpene-substrate selectivity, is a topic of current interest. 

Recent cluster analysis of the napyradiomycin biosynthetic gene cluster 
(nap) from the marine sediment Streptomyces sp. strain CNQ-525 and the 
actinomycete Streptomyces aculeolatus NRRL 18442 revealed multiple 
open-reading- frame sequences that could encode multiple V-CPOs and 
a flavin-dependent halogenase. Functional expression of the biosynthetic 
cluster leads to formation of chlorinated compounds (Fig. 7); however, a 
functional V-CPO has not yet been isolated”. 

Phylogenetic analysis of V-HPOs shows that the nap V-CPOs forma 
clade distinct from the marine algal V-BPO clade (A. nodosum, Corallina 
pilulifera and C. officinalis) and the fungal V-CPO clade (C. inaequalis 
and Embellisia didymospora)*. All of the V-HPOs havea set of conserved 
residues that comprise the His ligand to vanadate (His 496 in the number 
sequence of C. inaequalis V-CPO) and five of the other amino acids at the 
active site, most of which are involved in hydrogen-bonding to vanadate 
oxygen atoms. However, His 404, a residue that is found within hydrogen- 
bonding distance of the vanadate apical oxygen, is absent in the putative 


Figure 8 | Bromination of 3-oxo-hexanoyl 
homoserine lactone and subsequent hydrolysis of 
the dibromo product®. Many bacteria secrete acyl 
homoserine lactone compounds, which they use 

as signalling molecules to sense cell density. The 
3-oxo class of acyl homoserine lactone is readily 
susceptible to electrophilic halogenation at C2 with 
aqueous bromine solutions containing HOBr and 
Br,, as shown in this sequence of reactions. 
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Figure 7 | Proposed biosynthesis of the 
napyradiomycins A80915A-A80915D™. The 
sequence of reactions depicts chlorination and 
cyclization of the isoprene and the geranyl units 
in a manner analogous to V-BPO-catalysed 
bromination and cyclization of terpenes shown 
in Fig. 6b. 


nap V-CPOs”™. It will be interesting to learn the effects that the His 404 
substitutions (Ser, Phe) have on halide oxidation reactivity. 

Inan interesting display of chemical defence, some marine algae have 
evolved a means to disrupt bacterial quorum sensing and, thus, mitigate 
the effects of bacterial colonization of an alga’s surface. Quorum sensing 
is the mechanism by which bacteria sense cell density and regulate such 
phenotypic responses as biofilm formation, virulence, swarming motility 
and bioluminescence. Many bacteria use acyl homoserine lactones as their 
signalling molecules to activate a receptor-mediated cascade of events 
dependent on sufficient cell density”. 

The lack of microbial fouling of the marine red alga Delisea pulchra 
has intrigued marine ecologists for some time”. Delisea pulchra produces 
halogenated furanones that have been shown to inhibit quorum sens- 
ing by interfering with the receptor-mediated phenotypic responses of 
certain bacteria”. Intriguingly, the 3-oxo-acyl homoserine lactones are 
readily susceptible to electrophilic halogenation at C2, as demonstrated 
by reaction with aqueous HOX/OX /X, as well as by the V-HPO of Lami- 
naria digitata (Fig. 8)". The brominated products disrupt quorum sensing 
efficiently, as demonstrated with engineered reporter strains of Agrobac- 
terium tumefaciens and Chromobacterium violacium™. Some marine algae 
have haloperoxidases on their surfaces, suggesting that they use a double- 
pronged approach of attacking 3-oxo-acyl homoserine lactones secreted 
by bacteria attempting to colonize their surface, in addition to producing 
halogenated compounds to disrupt quorum sensing. 


Non-haenm iron halogenases 

A number of halogenated natural products, such as syringomycin E, 
barbamide and jamaicamide, contain halogen substituents at an unac- 
tivated carbon centre’*'**””. The biosynthesis of these compounds is 
catalysed by a new class of a-ketoglutarate (aKG)-dependent, non-haem 
iron (Fe,,,) halogenase enzymes that comprise a divergent subset of 
the larger class of Fey,;-aKG oxygenases. In the oxygenase family of 
enzymes, the bidentate complex of aKG coordinated to Fe(11) is bound 
in the enzyme active site by coordination to a conserved set of two His 
ligands and either an Asp or a Glu ligand that form a facial triad at the 
octahedral iron site. The pentadentate ligation leaves a vacant coordi- 
nation site for O, coordination. Dioxygen activation is coupled to the 
oxidative decarboxylation of aKG, a co-substrate in the overall enzyme 
reaction and a source of two of the four electrons required to reduce O, 
to H,O (Fig. 2, reaction 2). 

A distinguishing characteristic of the Fey,-aKG halogenases is the 
absence of the carboxylate protein ligand (that is, Glu or Asp). As shown 
in the X-ray structure of SyrB2, which was the first Fey), halogenase to 
be discovered (Fig. 9)*°, Fe(11) is coordinated by the two conserved His 
ligands and a halide ligand. The conserved Glu/Asp residue has been 
replaced with Ala; this means that the (His),(Asp/Glu) triad of the oxy- 
genases has been changed to (His),(Ala), thus creating a vacancy for 
the halide. 
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His 116 His 235 

Figure 9 | Structure of the active site of Fe,,,-aKG in SyrB2 from 

P. syringae*®. The active site comprises Fe(11) coordinated to two conserved 
His ligands and a halide ligand. A conserved Glu/Asp residue found in 
Feyy-aKG oxygenases has been replaced by Ala, creating a vacancy for the 
halide coordination. The carbon backbone is shown in teal green, oxygen in 
red, nitrogen in blue and the chloride ligand to the non-haem Fe(11) centre 
(orange) in bright green; the dashed lines indicate hydrogen bonding. 


The catalytic cycle of the halogenases (Fig. 10) resembles the reaction 
sequence of the Fey;;-aKG oxygenases up to the point of formation of 
the oxo-ferryl species**’’. The halogenation reaction begins with asso- 
ciation of the organic substrate in the enzyme active site, which triggers 
a change in the resting state enabling the subsequent coordination of 
O, by Fe(11) in (His),(X )Fe(11) yy-aKG (Fig. 10)*”. On the basis of the 
‘HAG’ mechanism of the Fe,,,-aKG oxygenases”*, Matthews et al.” have 
proposed a series of states (I, II and III; Fig. 10) that lead to hydrogen- 
atom abstraction by the haloferryl (X—-Fe(1v)=O) species. The oxygen- 
ase and halogenase mechanisms diverge at this point. The halogenase 
mechanism proceeds by a rebound attack by the halogen atom at the 
iron centre (Fe(111)—X) on the organic radical leading to the halogenated 
organic compound and Fe(11), as opposed to rebound by the hydroxyl 
radical (Fe(111)-OH) in the oxygenase mechanism. The hydrogen-atom 
abstraction step shows a large deuterium kinetic isotope effect, which 
has also been observed with other aKG-dependent hydroxylases”. 


The ‘substrate triggering’ mechanism in the Fey,,-aKG halogenases 
(Fig. 10) proposed by Matthews et al.”’ is consistent with the biosyn- 
thesis of syringomycin E (Fig. 1), in which L-threonine is chlorinated 
by the Fey,,-aKG halogenase component, SyrB2, of the non-ribosomal 
peptide synthetase biosynthetic gene cluster in Pseudomonas syringae 
B301D (Fig. 11). A number of other Fey, halogenases have also been 
identified or predicted”, including BarB1 and BarB2 from Lyngbya 
majuscula, which are involved in the trichlorination of L-leucine of 
barbamide”; CytC3 enzymes involved in the sequential chlorination of 
L-aminobutyrate”’; and even CmaB, which chlorinates the y position of 
t-allo-isoleucine (Fig. 11), among others™”’. In this last reaction, y-Cl- 
t-allo-isoleucine undergoes a y-elimination reaction catalysed by the 
enzyme CmaC, leading to cyclopropyl ring formation and the release of 
chloride, a process that has been termed cryptic chlorination”. 

PCR cloning of halogenase homologues has been useful in the discovery 
of new Fey;,;-aKG halogenases™. Of particular interest are the Fey,,-aKG 
halogenase components, CurA and JamE, of L. majuscula, which were 
predicted to be involved in the biosyntheses of curacin A through cryptic 
halogenation” and jamaicamide A through vinyl chloride formation”. 
Very recently, Gu et al.** demonstrated the remarkable sequence identity of 
CurA and JamE, at 92%, yet curacin A and jamaicamide A are structurally 
very different. In the Cur pathway a B-branched cyclopropane is formed, 
whereas in the Jam pathway a vinyl chloride is formed. The evolution of 
these metabolic pathways within the same organism is striking. Chlorina- 
tion occurs at a carboxylated y-carbon in both pathways, a reaction not 
previously observed in other halogenases. As depicted in Fig. 12, the bio- 
synthetic pathways diverge after chlorination at the decarboxylation step. 
After formation of a,8-enoy] thioester (I> I in Fig. 12) in the Cur path- 
way, the NADPH-dependent Cur enoyl reductase domain was found to 
catalyse formation of the cyclopropyl unit, presumably by intramolecular 
nucleophilic displacement of Cl. This reaction is new and distinct from 
the cryptic chlorination reaction catalysed by CmaB (Fig. 11d). However, 
in the Jam pathway a B,y-enoyl thioester of the 3-methyl-4-chlorogluta- 
conyl decarboxylation product is formed (I > II] in Fig. 12) that is not a 
substrate for the Jam enoyl reductase. The Jam enoyl reductase domain is 
a reductase for the a,}-enoyl thioester solely, and thus is not functional in 
the biosynthesis of jamaicamide. 
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Figure 10 | Proposed catalytic cycle of Fe,,,-aKG halogenases. The reaction 
is triggered on binding of the organic substrate, inducing a conformational 
change such that the resting Fe(11),,;-aKG state can then react with O,. The 


852 


scheme shows a series of states — I, II, and III — that lead to hydrogen- 
atom abstraction by the haloferryl (X—Fe(1v)=O) species. The halogenation 
reaction can then proceed as described in the main text. Adapted from ref. 47. 
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Non-metallo-haloperoxidases 

Not all biological halogenation is catalysed by metalloenzymes. A large 
class consists of the flavin-dependent halogenases (Fig. 2, reaction 2; 
for a current review, see ref. 18). These enzymes carry out halogenation 
reactions, such as in the biosynthesis of rebeccamycin. Halogenation by 
perhydrolase enzymes has also been demonstrated, although the biologi- 
cal relevance of this enzyme activity is not known”. 

A significant class of non-metallo-halogenating enzymes use SAM 
as a co-substrate. The first of the SAM-dependent halogenases to be 
discovered was the methyl chloride transferase enzyme in seaweeds 
and other plants, which is responsible for the production of profuse 
levels of methyl chloride’, raising questions about its biological 
function. Other SAM-dependent halogenases, including chlorinase”® 
and fluorinase’*”*” enzymes, are now known. These enzymes cata- 
lyse nucleophilic halogenation reactions, rather than electrophilic or 
radical halogenation reactions. The SAM-dependent chlorinase SalL, 
from the marine bacterium Salinospora tropica, chlorinates SAM by 
nucleophilic chloride addition generating 5’-chloro-5’-deoxyadenosine 
as a precursor to chloroethylmalonyl-CoA, which is ultimately incor- 
porated into salinosporamide A”. SalL will also accept Br and I" but 
not F directly. The fluorinase is the first step in the biosynthesis of 
fluoroacetate and 4-fluorothreonine from the soil bacterium Strepto- 
myces cattleya at the C5’ site of SAM in a S,2 reaction that occurs with 
inversion of configuration” 


Outlook 

In the past five years, more halogenating enzymes have been identified 
than in the four decades after the discovery of the first halogenating 
enzyme, haem CPO (C. fumago). The pace of discovery of new haloper- 
oxidases and halogenases is bound to continue to increase with the use of 
genomic and bioinformatic approaches. We will undoubtedly continue to 
isolate new halogenated natural products and uncover new biosynthetic 
pathways that co-opt halogenating enzymes for new purposes. 

However, in the four-and-a-half decades since the discovery of haem 
CPO, fundamental questions remain about the mechanism of the halo- 
genating enzymes. At first glance, the intricate balance between halo- 
genation and hydroxylation in the Fey,,-aKG enzymes would seem to 
be mediated by the replacement of a carboxylic-acid-containing amino 
acid, Glu or Asp (with Ala in the halogenases), creating a vacant coordi- 
nation site for halide coordination to iron. But relatively few structures 
are known for the Fey,,-aKG halogenases. It is not yet known how the 
Fey;-aKG active sites vary in this class of enzyme or what the elements 
of discrimination and selectivity are that exist within these enzyme active 
sites. The 45-year-old question regarding the nature of the halogenating 
moiety in the haloperoxidases remains unanswered. 

New spectroscopic techniques able to provide faster snapshots during 
turnover may be revealing. With significant advances in the sensitivity 
of isotope mass spectrometers, investigations of halogen isotope effects 
may have a more prominent role in mechanistic investigations, particu- 
larly now for ™Br/*'Br fractionation. Within the V-HPOs, elucidating 
the factors that control halide selection and positioning, as well as the 
organic substrate selectivity, will be particularly interesting. The struc- 
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Figure 11 | Examples of Fe,,,-aKG halogenase reactions. Fe,,,-aKG 
halogenases have been identified in the biosynthesis of a number of 

natural products, such as syringomycin E (a; Fig. 1), barbamide (b; Fig. 1), 
armentomycin (¢c) and coronamic acid (d). The triple arrows mean that 
multiple reaction steps and different enzymes are required for the indicated 
conversions. NRPS, non-ribosomal peptide synthetase. 


tures of the V-HPOs are fully superimposable at the V(v) centre, yet 
we know relatively little about the role that the active-site channel has 
in organic-substrate selectivity, and why some halogenated products 
predominate in one alga but not another related alga with a very similar 
haloperoxidase. In consideration of cryptic halogenation, it is not known 
what other natural products are formed via halogenated intermediates, 
including possibly in polymerization reactions. 

The parallel between haem, non-haem iron and flavin-dependent 
halogenase enzymes and haem, non-haem iron and flavin-dependent 
oxygenase enzymes is striking. In addition to these three types of oxy- 
genase enzyme, other oxygenases are also well known, including copper 
and manganese enzymes. Other metalloenzymes may be involved in 
halogenation reactions. In retrospect, it is not surprising that marine 
organisms evolved V-HPOs, given the halide content of the oceans and 
the fact that vanadium is the second most abundant transition-metal ion 
in seawater. Organisms living in other salty environments with unique 
transition-metal-ion compositions also may have evolved new metallo- 
enzymes that carry out halogenation reactions. a 


Figure 12 | Parallel assemblies in 

the biosynthesis of curacin A and 

O jamaicamide A. The pathways diverge after 
chlorination. ERase, enoyl reductase. The 
blue dot shows the B-branching carbon atom. 
Adapted from ref. 54. In the Cur pathway, 
compound I is converted to an a-f-enoyl 
thioester (II), which is a substrate for the Cur 
ERase. In the Jam pathway, compound I is 

\ converted to a B-y-enoyl thioester (III), which 
is not a substrate for the Jam ERase. 
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Design of functional metalloproteins 


Yi Lu’, Natasha Yeung’, Nathan Sieracki' & Nicholas M. Marshall’ 


Metalloproteins catalyse some of the most complex and important processes in nature, such as 
photosynthesis and water oxidation. An ultimate test of our knowledge of how metalloproteins work is to 
design new metalloproteins. Doing so not only can reveal hidden structural features that may be missing 
from studies of native metalloproteins and their variants, but also can result in new metalloenzymes 

for biotechnological and pharmaceutical applications. Although it is much more challenging to design 
metalloproteins than non-metalloproteins, much progress has been made in this area, particularly in 
functional design, owing to recent advances in areas such as computational and structural biology. 


Metalloproteins account for nearly half of all proteins in nature. Protein 
metal-binding sites are responsible for catalysing important biological 
processes, such as photosynthesis, respiration, water oxidation, molecu- 
lar oxygen reduction and nitrogen fixation. Much effort has been devoted 
to understanding the structure and function of metalloproteins, as sum- 
marized by other reviews in this Insight. The ultimate test is to use this 
knowledge to design new metalloproteins that reproduce the structures 
and functions of native metalloproteins’’. Metalloprotein design is not 
just an intellectual exercise that duplicates biochemical and biophysical 
studies of native metalloproteins. This ‘bottom-up approach can also eluci- 
date structural features that may remain hidden in those studies. Whereas 
biochemical and biophysical studies mostly reveal individual features that 
result in a loss of function, design requires the incorporation of all the 
structural features needed to attain a function. Equipped with insights 
from design processes, it may be possible to design new metalloproteins 
with improved properties such as higher stability and greater efficiency, 
or to impart them with functions not found in nature, for use in an even 
wider range of biotechnological and pharmaceutical applications. 

Metalloprotein design, however, has proven to be more challenging than 
the design of non-metalloproteins. Both fields require the issue of polypep- 
tide design to be addressed, but metalloprotein design involves metal ions, 
which are much larger in number than amino acids, and geometries that 
are much more variable. Conversely, most metal-binding sites are highly 
chromatic and display distinctive magnetic properties, making it easier 
to characterize the designed metalloprotein using metal-based spectro- 
scopic techniques rather than X-ray crystallography or nuclear magnetic 
resonance (NMR), thus shortening design cycles. Therefore, the field 
of metalloprotein design has enjoyed much success recently, owing to 
advances in biophysical, computational and structural biology. The field 
is also progressing from purely structural to more functional designs, and 
from the design of the primary coordination sphere to that of the second- 
ary coordination sphere and beyond. We summarize these advances here, 
with a focus on creating new and functional metal-binding sites in either 
de novo-designed or native protein scaffolds. In addition, the emerging 
area of designing novel metalloproteins with functions enhanced beyond 
those available in nature, through the introduction of unnatural amino 
acids or non-native metal cofactors, is covered. We also summarize the 
methodologies used in achieving these goals. 


Metalloprotein design in de novo-designed scaffolds 
The purest and most challenging form of metalloprotein design is de novo 
design, or design ‘from scratch. De novo design of a metalloprotein 


involves constructing a polypeptide sequence that is not directly related 
to any natural protein and that folds precisely into a defined three- 
dimensional structure that binds a metal ion’. Most work in this area 
has focused on introducing metal-binding sites into designed a-helical 
bundles (Fig. 1), which are among the first de novo-designed proteins’. 
These a-helical bundles are a common scaffold for a number of haem 
proteins in nature. Therefore, not surprisingly, haem centres were among 
the first, and are some of the most common, metal centres incorporated 
into de novo a-helical bundles’, from mono-haem binding? (Fig. 1a) 
to multi-haem maquettes’ (Fig. 1b), mimicking long-range electron- 
transfer proteins. These proteins were rationally designed by direct com- 
parison with native proteins, such as cytochrome bc,, for clues to the 
placement of essential haem-binding and structure-stabilizing residues. 
In addition to electron-transfer* and proton-coupled exchange” func- 
tions, a functional aniline hydroxylase”, peroxidase’ and oxygenase” 
have also been engineered into de novo-designed proteins. An exciting 
development in this field is the design of a di-haem-containing four- 
helix bundle with oxygen transport properties, whose O, affinities and 
exchange timescales match those of natural globins'’. Remarkably, this 
designed protein binds O, even tighter than carbon monoxide, a desir- 
able property that most native O,-binding haem proteins do not have. 
A key to its success is a reduction of helical-interface mobility to exclude 
water from the haem-binding site, thereby reducing haem oxidation and 
increasing O,-binding stability. 

In addition to haem, other metal ions/cofactors have been engi- 
neered into helical bundles by introducing metal-binding ligands at 
specific locations to mimic those in native proteins. Examples are the 
Cys,His, ligand set found in zinc-finger proteins” and the His, set in 
carbonic anhydrase’’. The X-ray crystal structure of an arsenic-bound 
three-stranded coiled-coil protein (Fig. 1c), which resembles the puta- 
tive tris-Cys metal-binding environment of the natural arsenic-binding 
protein, ArsR, has been reported, affording the first biologically rel- 
evant thiolate mimic of an AsS, coordination’. Metal-binding ligands 
Glu and His have also been introduced by retrostructural analysis, 
whereby the active site of a protein can be separated into idealized sec- 
ondary structural components (for example an a-helix and a B-hairpin) 
and mathematically described as a root-mean-squared deviation from 
the idealized structure. This strategy led to the construction of dinu- 
clear metalloproteins in a-helical bundles that can bind two Zn(11), 
Co(11) or Fe(11), with the Zn(11) crystal structure (Fig. 1d) shown to be 
structurally analogous to the di-iron site of ribonucleotide reductase 
and ferritin’. One of these di-iron proteins was reported to catalyse 
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Figure 1| Designed metalloproteins using de novo-designed scaffolds. 

a, Computer model of a bis-histidine-ligated mono-haem a-helical bundle. 
The haem cofactor (central molecule) is in cyan, and the relevant haem- 
binding His ligands are shown in stick format, with nitrogen in blue and 
carbon in cyan. (Panel modified from ref. 6 courtesy of W. DeGrado.) 

b, Computer model of a bis-histidine-ligated multi-haem four-a-helical 
bundle. Haem cofactors are shown in red. Relevant haem-binding His 
ligands are shown in cyan, and the disulphide bond is shown in yellow. 


the two-electron oxidation of 4-aminophenol, similar to the reaction 
catalysed by di-iron phenol oxidases'*”. 

Motivated by the natural occurrence of Cys-rich coordination sites 
in proteins, the introduction of Cys ligands led to the development of 
a-helical coiled-coil metalloproteins capable of binding Co(1), Fe(11), 
Zn(11), Cd(a1), Hg(11) and As(11)”””. One interesting property of these 
de novo-designed proteins is that they can stabilize unusual metal-coor- 
dination states, for example three-coordinate Hg(11) stabilized by a tri- 
helical bundle, in preference to more normal coordination states, such as 
the bis-coordination often preferred by Hg(11)”*. In other cases, metal 
binding can actually direct the folding and assembly ofan a-helical bun- 
dle from either a random coil or a less-folded or misfolded state, as has 
been shown for Cd(m1)* and Hg(m)* binding to Cys-containing peptides 
predisposed to form helices. Interestingly, geometry-selective binding 
of two different Cd(11) atoms in one designed protein has been achieved 
and confirmed by the correlation of '*Cd NMR and ‘Cd perturbed- 
angular-correlation spectroscopy”. 

In stark contrast to the design of a-helical proteins, de novo design of 
6-structure proteins is still in its infancy, because there are few model 
systems with which to study B-sheets in isolation from other protein 
structures. The de novo design of a redox-active rubredoxin mimic, 
RM1, is a rare example of a structural and functional metallo-f-sheet 
protein”. RM1 was shown to bind iron and reversibly cycle between 
the Fe(11) and Fe(111) oxidation states, mimicking native rubredoxin. 
The RM1 £-hairpin structure was designed computationally by con- 
straining the positions of the peptide that were strategic in mimicking 
and/or stabilizing the B-structure and by subsequently using another 
computer program, SCADS”, to select the most probable amino acids 
for the remaining positions. 


Metalloprotein design in native protein scaffolds 

Designing metalloproteins using de novo-designed scaffolds offers the 
prospect of complete control over a protein's structure and metal-bind- 
ing properties, but our current knowledge of protein folding limits the 
number of de novo-designed scaffolds to only a few types, such as the 
a-helical bundles discussed above. It has been observed that proteins 
containing B-sheets tend to support more rigid and more pre-organized 
metal-binding sites than a-helical proteins, which are inherently more 
flexible”. The Protein Data Bank contains many more natural protein 
scaffold types (~1,000) than it does de novo-designed scaffolds (http:// 
scop.mrc-Imb.cam.ac.uk/scop/count.html#scop- 1.73), and most of the 
natural protein scaffolds maintain the same fold and are of a similar 
stability even after numerous mutations. Therefore, metalloprotein 
design using native protein scaffolds instead of those designed de novo 
provides many more scaffold choices that are more tolerant to muta- 
tions, allowing less concern over decreased protein stability when 
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(Panel modified from ref. 7 courtesy of P. L. Dutton.) ¢, X-ray crystal 
structure of As(111)-bound three-stranded coiled-coil protein (Protein Data 
Bank (PDB) code 2JGO)"”. As(111) is represented as a green ball; relevant 
metal-ion-binding Cys ligands are indicated in stick format, with sulphur in 
yellow and carbon in cyan. d, X-ray crystal structure of di-Zn(11) due ferro 

1 (PDB code 1EC5)””. Zn(11) ions are represented by grey balls; relevant 
metal-ion-binding His and Glu ligands are also indicated in stick format, 
with nitrogen in blue, oxygen in red, and carbon in cyan. 


incorporating metal-binding sites into proteins. When chosen care- 
fully, native proteins are also easier to crystallize than de novo-designed 
scaffolds, making three-dimensional characterization possible, which 
is critical to success. Perhaps the strongest argument for designing 
metalloproteins by using native scaffolds is that nature uses a similar 
approach — the same scaffold is often seen in numerous proteins with 
diverse metal-binding motifs and functions, suggesting that these native 
scaffolds are robust and modifiable. For example, the B-barrel fold has 
been shown to be used by ~600 types of protein, with diverse functions 
such as oxidase, reductase, amylase and dismutase activities. Discover- 
ing how nature is able to use the same scaffold to design a variety of 
metal-binding sites is an important goal of metalloprotein design’. 

Biochemical techniques, such as site-directed mutagenesis, have 
been used extensively to study the function of metalloproteins. The 
loss of function accompanied by certain mutations (usually to highly 
conserved residues) allows the identification of residues essential for 
function. Although serving a different purpose, the same mutagenesis 
techniques can be used in metalloprotein design to impart new function 
into a protein scaffold by introducing residues that bind metal ions. This 
can be accomplished through the redesign of existing metal-binding 
sites to introduce novel functionality, the introduction of mononuclear 
metal-binding sites into proteins that do not bind metal ions natively, 
or the introduction of homonuclear or heteronuclear metal-binding 
sites into proteins. Design is often aided by empirical approaches based 
on prior knowledge and experience, the use of computer programs 
(rational design) or combinatorial selections. 

The redesign of an existing metal-binding site to introduce new 
function or metal specificity is the simplest form of metalloprotein 
design, as this approach relies on the structural differences between 
template and target proteins and is amenable to empirical approaches 
as mentioned above. Despite its simplicity, this approach is still power- 
ful in elucidating structural features important for the change in, or 
gain of, function by redesign. Haem proteins have been extensively 
redesigned in this way. They are one of the most diverse classes of 
protein, having functions ranging from electron transfer to small- 
molecule (for example oxygen and nitric oxide) transport, sensing 
and activation’***’. By systematically changing the characteristics 
of haem proteins, researchers have been able to convert one type of 
haem protein into another and to introduce new function or substrate 
specificity into a haem protein? ™. 

The selectivity of designed metalloproteins for specific metals has 
also been used for the purpose of metal-sensing applications. Ratiomet- 
ric Zn(11) sensors have been created by designing Zn(11)-binding sites 
into green fluorescent proteins and by taking advantage of the resultant 
fluorescent signal variations due to Zn(11)-dependent conformational 
changes***’. The metal-binding site of NikR, a DNA-binding protein, 
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has been redesigned to bind UO,” rather than its native Ni** cofactor® 
(Fig. 2a), resulting in an artificial protein that binds DNA only in the 
presence of UO,”. In addition to sensors, designed metal-binding sites 
have allowed modulation of protein-protein interactions””°, which 
are at the heart of many diverse biological functions. Because of com- 
plicated non-covalent interactions, it has been difficult to control the 
protein-protein interactions well. These designed metal-binding sites 
allow specific control over them. 

More complex redesign of metal-binding sites may require rational 
design using computational biology, such as the simultaneous incor- 
poration and adjustment of functional elements to convert the Gly II 
enzyme from a glyoxalase, which hydrolyses thioester bonds, to a 
B-lactamase”’. In this case, to impart the new functionality, the metal- 
binding site had to be significantly redesigned to change the metal- 
binding geometry, as well as the substrate-binding pocket. 

Although redesign of metal-binding sites can offer insight into the 
different structural features possessed by the template and target pro- 
teins, it cannot tell us about the structural features common to the 
metal-binding sites of both proteins. Introducing metal-binding sites 
into a protein location where no native metal-binding site is found 
can overcome this limitation, although this approach is more com- 
plex. A common strategy is to base the design on structural homol- 
ogy between the template protein (which contains no metal ions) and 
the target metalloprotein, in order to introduce metal-binding sites 
into the template protein at the positions corresponding to those in 
which they are found in the metalloprotein. Using this strategy, new 
Zn(11)-binding sites have been introduced into charybdotoxin” and 
retinol-binding protein® to mimic carbonic anhydrase. 

When there is no structural homology, computer-aided design using 
programs such as METAL-SEARCH™ and DEZYMER* has become 
indispensable. A Ca(11)**, a Pb(1)” and an [Fe(Cys)4] * rubredoxin 
centre have all been designed into scaffold proteins using computer- 
aided design. In the case of an artificial rubredoxin site designed into 
the non-metalloprotein thioredoxin, it was found to undergo several 
rounds of air oxidation and reduction by -mercaptoethanol, similar 
to native rubredoxins”. 

Even more difficult in metalloenzyme design is the introduction 
into proteins of dinuclear metal-binding sites or metal clusters. An 
example is the binuclear Cu, centre, consisting of two copper ions 
bridged by two Cys thiolates, with each copper ion also coordinated 
to a His nitrogen (Fig. 2b). New Cu, centres were designed into the 
cupredoxins azurin*” and amicyanin®””, using a technique called 
loop-directed mutagenesis, in which the copper-binding loop of a 
cupredoxin is replaced by the corresponding loop in cytochrome c 
oxidase (COX; also known as CcO), which has similar structural 
homology. Loop-directed mutagenesis has also been used to introduce 
lanthanide-binding sites into a helix-turn—helix motif by drafting a 
calcium-binding EF-hand loop, and such a designed protein has been 
shown to promote DNA and phosphate hydrolysis”. 

Heteronuclear bimetallic centres are also found in many natural 
proteins, and perform important biological functions such as oxygen 
reduction, lignin degradation and nitric oxide reduction. Designing 
metalloproteins to mimic the enzymes that carry out these complex 
reactions is even more challenging than designing homonuclear metal- 
binding sites because of the need to selectively place and bind two or 
more different metal centres. An example is the haem—Cu, centre in 
COX, where three His molecules bind a Cu(1) ion that is ~4.5 A away 
from the haem centre. Computer-aided design was used to introduce 
the His molecules above the haem of myoglobin at positions corre- 
sponding to those in COX” (Fig. 2c). Copper binding was confirmed 
by multiple spectroscopic techniques. Further studies showed the 
importance of the Cu, site in O, binding and reduction, as well as the 
significance of a proton network™. This designed protein also displays 
interesting functional properties, such as nitric oxide reduction”, simi- 
lar to certain members of the COX family. A Mn(11)-binding site has 
also been designed into cytochrome c peroxidase, resulting in a haem- 
Mn(t11) protein that structurally and functionally mimics manganese 
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peroxidase’. Manganese peroxidase is a protein involved in lignin bio- 
degradation, which is a critical step in biomass conversions from plant 
materials to biofuels. 

In addition to discrete metal-binding sites, metal nanoparticles 
have been introduced into proteins”, including platinum nanopar- 
ticles into ferritin’ and a small heat-shock protein”. The former 
catalyses size-selective alkene hydrogenation and the latter catalyses 
hydrogen production. 

Although approaches based on empirical knowledge and rational 
design using computer programs have been relatively successful in 
designing metalloproteins, it is still difficult to design certain met- 
alloproteins, especially functional ones, owing to the large number 
of potential interactions within the active site of a metalloenzyme. 
In this case, combinatorial design and directed evolution of new 
metalloenzymes is another powerful method. For example, directed 
evolution has been extensively applied to antibodies to develop new 
metalloenzymes. Using phage display, the 7G12 antibody protein 
was evolved to bind specifically to haem, with which the oxidation 
of tyramine could be catalysed®. A similar technique has recently 
been used to find antibodies specific for inorganic ligands, such as 
2,2'-bis(diphenylphosphino)-1,1’-binaphthyl (BINAP) and 1,1’-bi- 
2-naphthol (BINOL), with the goal of developing completely new 
metalloenzymes™. Another striking example of this approach is the 
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Figure 2 | Designed metalloproteins using native scaffolds. a, Native nickel- 
binding site of NikR (left) and the redesigned UO,”*-binding site of the NikR 
variant (right). b, Scheme showing the replacement of the copper-binding 
loop of blue copper azurin (blue; PDB code 4AZU) with the copper-binding 
loop from the Cu,-containing subunit of COX (purple) to yield the dinuclear, 
purple Cu, azurin construct (PDB code 1CC3). ¢, Catalytic haem—Cuy, centre 
in COX (left; PDB code 1AR1) and the designed haem-copper model in 
sperm whale myoglobin (right). For panels b and ¢, active-site nitrogen is 
shown in blue, oxygen in red, carbon in cyan, sulphur in yellow and iron in 
brown; copper is shown in purple or blue (to match the protein) in b and 
green inc. 
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use of directed evolution to obtain a new form of cytochrome P450 
with the ability to hydroxylate propane selectively, a function that has 
not yet been achieved using small-molecule catalysts®. 


Designing unnatural metalloproteins 

Metalloprotein design using de novo-designed or native protein scaf- 
folds has succeeded in providing valuable insights into the funda- 
mental rules that govern protein structure and function. Replicating 
native protein structure and function, although challenging, is not 
the only goal of protein designers; they want to build proteins with 
properties that exceed those found in nature”. In recent years, the 
repertoire of functional groups such as amino acids or cofactors avail- 
able to protein designers has been expanded beyond that available in 
biological systems. 

A myriad of techniques have been developed for the incorpora- 
tion of unnatural amino acids into a protein scaffold, each with its 
own advantages and limitations”. Total synthesis of proteins using 
solid-phase peptide synthesis allows the incorporation of any syn- 
thetic amino acid for which successful protection and deprotection 
techniques can be applied™, but cost and size constraints limit the use 
of total synthesis to smaller proteins (~60-100 amino acids). Native 
chemical ligation, involving covalent attachment of two synthetic 
peptides, has expanded this size®’, but cost is still an issue, at least 
for very large proteins. Expressed-protein ligation, which involves the 
coupling of a bacterially expressed peptide fragment with a synthetic 
peptide fragment containing unnatural amino acids, has reduced the 
cost and extended the protein size even further, but it often requires 
a Cys residue at the point of ligation to maximize ligation efficiency® 
(methods have been developed to use other residues at the point of 
ligation, but with less efficiency). Methods not restricted by protein 
size include auxotroph generation”, chemical residue modification® 
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Figure 3 | Site-specific incorporation of unnatural amino acids into a protein 
scaffold for the tuning of metalloprotein functional properties. a, Crystal 
structure of rubredoxin (PDB code 1CAA), showing iron (green) and 

four Cys residues (in stick format). The -OH group in the highlighted 
phenylalanine has been replaced with an -H, -F-, -CN- or -NO, group”. 

b, Crystal structure of azurin (PDB code 4AZU), showing copper (brown) 
and one Cys, one Met and two His residues (in stick format). The highlighted 
Cys has been replaced with selenocysteine”, and the highlighted Met has been 
replaced with selenomethionine, oxomethionine, norleucine and di- and 
trifluoromethionine”. For both panels, active-site nitrogen is shown in blue, 
oxygen in red, carbon in cyan and sulphur in yellow. 
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and cavity complementation™, but these methods have limited ver- 
satility in terms of the variety of unnatural amino acids that can be 
incorporated. A promising and versatile technique is the use of custom 
evolved transfer-RNA synthetase molecules for in vitro and in vivo 
translation of new residues through recognition of the amber stop 
codon”. A major contribution resulting from this approach was the 
design of a DNA-cleaving protein through site-specific genetic incor- 
poration of the DNA-cleaving agent (2,2’-bipyridin-5-yl)alanine into 
a DNA-binding protein”. On addition of Cu(1) or Fe(11) and reducing 
agent to the protein, a double-stranded DNA substrate was cleaved at 
its consensus sequence. The variety of unnatural amino acids that this 
technique can incorporate is greater than that which can be used in 
auxotroph generation, chemical residue modification or cavity com- 
plementation, but less than that which can be used in total synthesis, 
native chemical ligation or expressed-protein ligation. 

The expansion of the amino-acid regime has made significant con- 
tributions to the systematic determination of the role of each amino 
acid in tuning the activity of the metal centre to a given reactivity, and 
in increasing the breadth of chemical transformations performed by 
proteins. For instance, replacement of haem-binding His residues in 
de novo-designed four-a-helical bundles with 4-B-(pyridyl)-L-alanine 
and 1-methyl-1-histidine has resulted in an increase in the reduction 
potential of the metal site for the pyridyl system and a preference for 
five-coordinate low-spin haem in the 1-methyl-t-histidine system”. 
In another system, introduction of bulky penicillamine ligands in place 
of Cys molecules led to the creation of an open metal-binding site in a 
ferredoxin model peptide’. This methodology resulted in the isolation 
of the first water-stable CdS, complex through subtle manipulation 
of steric crowding around the metal centre”. Furthermore, substitu- 
tion of L-amino acids for their D-enantiomers resulted in fine control 
between three- and four-coordinate Cd(11) complexes in a de novo- 
designed metallopeptide”. Finally, the role of a conserved Tyr residue 
near the active site of rubredoxin was probed using a set of unnatural 
Tyr analogues in which the para -OH group was replaced with -H (as 
in native Phe), -F, -CN and -NO, groups, spanning a range of electron 
withdrawing strengths” (Fig. 3a). Interestingly, a linear correlation was 
found between the reduction potential of the rubredoxin and the Ham- 
mett o,, value, but no such trend was observed with dipole moment or 
para-substituent size. 

A powerful advantage of introducing unnatural amino acids is the 
ability to probe the role of the amide backbone, because site-directed 
mutagenesis using natural amino acids cannot modify the protein 
backbone. Both the carbonyl and nitrogen functionalities of the pro- 
tein backbone have been implicated in critical and specific hydrogen- 
bonding interactions, either directly to the metal or to the metal ligands. 
For example, the Fe,S, cluster of high-potential iron proteins has such 
an amide interaction, which was found to be responsible for stabiliz- 
ing the reduced form through attenuation of a metal-bound sulphur”. 
Replacement of this amide linkage with an ester linkage indeed resulted 
in a lowering of the reduction potential by ~100 mV. 

Through the use of expressed-protein ligation, the metal-bound Cys 
and Met residues in the electron-transfer protein azurin were replaced 
with isostructurally similar unnatural amino acids” (Fig. 3b). Replac- 
ing Cys with selenocysteine resulted in significantly different spectral 
features, but did little to affect the reduction potential. By contrast, 
replacing methionine with selenomethionine and other isostructural 
variants resulted in spectral features similar to those of the wild-type 
protein, but with significantly different reduction potentials. The use 
of unnatural amino acids helped to identify the hydrophobicity of the 
residues at this position as the major factor in tuning the reduction 
potential of the protein, resulting in new metalloproteins with reduc- 
tion potentials beyond those found in native azurin and its variants. 

The repertoire of non-native functionalities has also been expanded 
through the incorporation into the protein scaffold of metal-containing 
cofactors not found in nature. Biological systems incorporate metal 
cofactors such as haem by both non-covalent and covalent means. Of 
the covalent approaches in nature, both single-point and dual-point 
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Figure 4 | Examples of strategies for the incorporation of non-native metal 
cofactors into protein scaffolds. a, Myoglobin (protein backbone shown in 
green) in which native haem b has been replaced non-covalently with iron 
porphycene (stick and ball illustration); generated from the structure with 
PDB code 2D6C (ref. 82). Iron is shown as a brown sphere. b, Myoglobin 
(protein backbone in orange) in which native haem b has been replaced 
non-covalently with Cr(3,3’-Me,-salophen) (stick and ball illustration); 
generated from the structure with PDB code 1J3F (ref. 86). Chromium is 
shown as a pale pink sphere. c, Streptavidin (protein backbone in red) into 
which biotinylated n°-(benzene)RuCl (stick and ball illustration) has been 
incorporated non-covalently; generated from the structure with PDB code 


anchoring strategies are used to position the prosthetic group within 
the protein scaffold. These strategies have been used for the incorpo- 
ration of non-native metal cofactors, with the non-covalent approach 
being the most extensively used strategy. For example, replacement of 
the native haem cofactor in myoglobin with less symmetrical structural 
isomers such as porphycene (Fig. 4a) has resulted in new or enhanced 
O,-binding affinity or O,-activation activities*’. In addition to native 
protein scaffolds, de novo-designed a-helical bundles have also been 
engineered to bind non-biological haem cofactors in preference to bio- 
logical haems*™’. 

In addition to haem analogues, other planar moieties similar to haem 
have been substituted into haem proteins. For example, strong bind- 
ing of manganese, iron and chromium Schiff-base complexes into a 
myoglobin scaffold was observed after careful design of the protein to 
relieve steric repulsion in the active site”’***° (Fig. 4b). Such rational 
design has resulted in new asymmetrical catalysts capable of oxidation- 
rate enhancement and chiral induction. 

It is much more challenging to incorporate non-planar metal-con- 
taining cofactors into proteins in which there is no native binding site. 
One way to do so is through derivatization of the catalyst with a moiety 
exhibiting natural bio-affinity for a protein target, through, for example, 
the use of the high-affinity biotin-streptavidin couple (association con- 
stant, K,=~10'°M’’). In this way, metal cofactors such as a ruthenium 
catalyst have been incorporated into the streptavidin scaffold through 
covalent attachment to a biotin moiety” ® (Fig. 4c), resulting in artifi- 
cial biocatalysts with new activities, including hydrogenation of alkenes, 
transfer hydrogenation of ketones, and carbon-carbon bond formation 
through allylic alkylation (with up to 99% enantiomeric excess)**””. 

Although the use of biotin and avidin/streptavidin to incorporate 
non-native metal cofactors is powerful, not all cofactors are amenable 
to such a strategy, as binding affinity and location may not be readily 
controlled. Covalent attachment is an effective and versatile method for 
linking a cofactor to the protein at a defined location through selective 
conjugation techniques involving engineered or native Cys or Lys resi- 
dues. This strategy has been used to attach DNA-cleaving agents such as 
1,10-phenanthroline-copper and iron-EDTA to DNA-binding proteins 
for the generation of artificial nucleases”, and in the case of 1,10-phen- 
anthroline-copper (Fig. 4d) for enantioselective hydrolysis”. 

The single-point covalent approach mentioned above makes it pos- 
sible to attach a non-native cofactor at a defined location, but it does 
not necessarily fix the conformation of the cofactor inside the protein. 
For example, single covalent attachment of a 4-(2-methanesulphonyl- 
thio-ethoxy)salicylidene-1,2-ethanediamino-manganese(111) (Mn- 
salen) cofactor to apo myoglobin through a single Cys-selective linker 


2QCB (ref. 91). Ruthenium is shown as a black sphere. d, Adipocyte 
lipid-binding protein (protein backbone in grey) to which 
1,10-phenanthroline (stick and ball illustration) has been attached 
covalently through a single point, Cys 117; generated from the structure 
with PDB code 1A18 (ref. 94). e, Myoglobin (with Leu72Cys and 
Tyr103Cys) (protein backbone in magenta) in which a Mn-salen complex 
(stick and ball illustration) is anchored covalently at two points; generated 
from native myoglobin structure and structural modelling”. Manganese 
is shown as an orange sphere. For all active-site stick and ball illustrations, 
nitrogen is shown in blue, oxygen in red, carbon in cyan and sulphur 

in yellow. 


resulted in only 12% enantiomeric excess towards sulphoxidation of 
thioanisole”, suggesting that the Mn-salen conformation may be too 
flexible inside myoglobin to facilitate high enantioselectivity. Addition 
of a second linker (Fig. 4e) to the cofactor together with selective cavity 
positioning led to a significant increase in catalytic efficiency (more 
than a sevenfold increase) and enantioselectivity (increased from 12% 
to 88%)”**’. Furthermore, it has been shown that such incorporation of 
Mn-salen into a myoglobin scaffold significantly enhances the chemo- 
selectivity in oxidation of thioanisole with H,O,, almost eliminating 
over-oxidation to the sulphone product through electrostatic exclusion 
of the sulphoxide product from the site”’. 


Lessons learned and future challenges 

Tremendous progress has been made in this exciting field of synthetic 
biology. Success is measured not only by how closely the designed 
protein resembles the native protein, but also by the insights gained 
from the process. For example, the crystal structure of a designed Cu, 


Cu, azurin Native COX 


Cu, azurin 


Native COX 

Figure 5 | Close match between a designed metalloprotein and its native 
target protein. Crystal structures of a designed Cu, site in azurin (left; 
PDB code 1CC3)” and a Cu, site in native COX (right; PDB code 1AR1) as 
viewed from above the Cu,(S,,,), plane (a) and along the Cu,(S.,,), plane (b). 
Active-site nitrogen is shown in blue, oxygen in red, carbon in cyan, sulphur 
in yellow and copper in purple. 
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site is almost identical to that of native Cu,, including both primary 
and secondary coordination spheres” (Fig. 5), indicating that metallo- 
protein design can lead to structures closely resembling those in nature. 
More importantly, two metal-binding sites, native blue copper and the 
designed Cu, centre, were placed in the same azurin scaffold, allowing 
the electron-transfer properties of two different metal centres to be 
compared in the same protein” — a task that is difficult, if not impos- 
sible, to do by studying native proteins. Such a comparison provided 
deeper insight that the Cu, centre is a more efficient electron-transfer 
centre than the blue copper centre”. 

Given the initial successes, now is a good time to summarize the 
lessons learned and to recognize future challenges. Essential to suc- 
cess is the recognition that structural biology must have a major role 
in the design process. Some early metalloprotein designs generated 
much excitement in the field, only to be shown by X-ray crystallogra- 
phy to contain a different metal-binding site than originally designed. 
Although spectroscopy is invaluable in providing support for design, 
more concrete evidence from three-dimensional structural charac- 
terization is essential. In addition to confirming the initial design, it 
provides detailed information on what has or has not been designed 
correctly, thus providing insights for the next level of design. 

Furthermore, design of the primary coordination sphere is often not 
enough to confer structure and/or function. Consideration of the envi- 
ronment surrounding the primary coordination sphere is critical to suc- 
cessful design. One main example is water molecules found at specific 
locations inside the protein, where the associated hydrogen-bonding 
interactions have essential roles in many metalloproteins, including 
cytochrome P450, haem oxygenase (HO) and COX”. For example, 
even though a Cu, centre that structurally mimics the haem-copper 
centre in COX was designed into myoglobin”, its function mimics HO 
rather than COX when O, is used as an oxidant™. When H,O, was used 
as an oxidant, a ferryl species, a putative intermediate in COX activity, 
was observed. These results suggest that the extra protons introduced 
through H,O, are important for function™. The introduction of an 
additional hydrogen-bonding donor through the replacement of the 
type-b haem in myoglobin with a type-o haem, resembling the haem 
found in COX, allowed suppression of HO activity and promotion of 
COX activity”. 

Metalloprotein design has focused mainly on the positive design 
of structural features responsible for correct metal-site geometry and 
function. However, negative design to prevent competing metal-site 
geometries and unwanted functions is also critical. For example, in 
designing dinuclear metalloproteins in a helical bundle, there was 
a possibility that undesirable heterotetrameric motifs and homo- 
oligomers would form because the protein comprised different pep- 
tides. To circumvent this problem, a computational algorithm was 
included to destabilize these interactions”. 

The final lesson is that metalloprotein design often results in lower 
stability because of changes made to the protein scaffold. Sometimes, 
the stability is so low that even if the design is valid, not enough stable 
protein can be obtained to prove the design. The stability of the protein 
must be improved, even if function may not be improved directly. A 
good example of this was demonstrated in cytochrome P450: only after 
the protein was stabilized by further mutations could the initial design 
confer new activity’. 

The majority of the successes in the field of metalloprotein design 
are derived from the construction of a geometrically correct, sterically 
compatible primary coordination sphere sufficient to reproduce the 
structure and function of a desired metal centre, such as a bis-histidine 
haem or His,Cys, tetrahedral Zn(11) site. However, many metal-binding 
sites in proteins, such as the type-1 copper and Cu, centres, are not 
necessarily in their preferred metal-ion geometric states. Therefore, 
one future challenge is to design metal sites and metal clusters with 
unique geometric requirements. Other challenges include the design 
of metal sites at the interface of two or more proteins, which occur 
often in proteins but have not been designed until recently’, and 
the design of metal-binding sites from membrane proteins, which has 
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also been rarely achieved. A greater challenge is the design of protein 
metal-binding sites requiring helpers called metallochaperones. Con- 
sideration of the interactions between metal ions and protein hosts is 
not enough to confer function, making their interactions with metallo- 
chaperones necessary in the design process. In addition to metal-bind- 
ing-site design, functional metalloproteins also require the design ofa 
substrate-binding site for catalysis. A higher level of complexity involves 
coupling the redox reaction ofa designed metal-binding site to proton 
transfer, conformational change or charge separation. 

To meet these challenges, advances ina number of fields are required. 
The most important requirement is the development of better computer 
programs for designing metal-binding sites in proteins, including consid- 
eration of structural features beyond the primary coordination sphere. 
In contrast to well-defined force fields for peptide bonds and amino- 
acid side chains, most force fields for metal-binding sites are ill-defined 
because there are many more variables involved in defining a metal-bind- 
ing site. These variables include variation in the identity of the metal ion, 
different oxidation states of the same metal ions and different ligand and 
geometric preferences of different metal ions. Density functional theory 
has been an excellent tool in defining small metal complexes. Extending 
its success to metal-binding sites in proteins, together with other leading 
algorithms, such as QM/MM (quantum mechanics/molecular mechan- 
ics), is required for the successful rational design of metalloproteins. 
Despite these challenges, metalloprotein design will remain a vibrant 
field in which researchers will continue to design metalloproteins with 
novel functions and gain new insights. a 
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Chd1 regulates open chromatin and 
pluripotency of embryonic stem cells 


Alexandre Gaspar-Maia’’, Adi Alajem’, Fanny Polesso'’”, Rupa Sridharan”, Mike J. Mason’, Amy Heidersbach”, 
Joao Ramalho-Santos®, Michael T. McManus’, Kathrin Plath’, Eran Meshorer* & Miguel Ramalho-Santos’” 


An open chromatin largely devoid of heterochromatin is a hallmark of stem cells. It remains unknown whether an open 
chromatin is necessary for the differentiation potential of stem cells, and which molecules are needed to maintain open 
chromatin. Here we show that the chromatin remodelling factor Chd1 is required to maintain the open chromatin of 
pluripotent mouse embryonic stem cells. Chd1 is a euchromatin protein that associates with the promoters of active genes, 
and downregulation of Chd1 leads to accumulation of heterochromatin. Chd1-deficient embryonic stem cells are no longer 
pluripotent, because they are incapable of giving rise to primitive endoderm and have a high propensity for neural 
differentiation. Furthermore, Chd1 is required for efficient reprogramming of fibroblasts to the pluripotent stem cell state. 
Our results indicate that Chd1 is essential for open chromatin and pluripotency of embryonic stem cells, and for somatic cell 


reprogramming to the pluripotent state. 


The genome of eukaryotic cells is organized into accessible euchroma- 
tin that is permissive for gene activation, and packaged heterochro- 
matin that is largely silenced. Different cellular states may be defined at 
least in part by differential allocation of genomic regions to specific 
chromatin domains’. Several types of stem cells in organisms ranging 
from planarians* to mammals** have been reported to have an open 
chromatin largely devoid of heterochromatin. This phenomenon has 
been analysed in greater detail in pluripotent mouse embryonic stem 
(ES) cells. These cells have an open, ‘loose’ chromatin with high rates of 
histone protein exchange, and accumulate regions of more rigid het- 
erochromatin after differentiation®®. An open chromatin correlates 
with a globally permissive transcriptional state, and has been proposed 
to contribute to the developmental plasticity, or pluripotency, of ES 
cells®. Although there is a strong correlation between open chromatin 
and the undifferentiated state of stem cells, it remains unknown 
whether open chromatin is necessary for stem cell potential. 
Furthermore, little is known about the molecules that may regulate 
open chromatin in stem cells. Here we report the identification of the 
chromatin remodeller Chd1 as an essential regulator of open chro- 
matin and pluripotency of ES cells, and of somatic cell reprogramming 
to pluripotency. 


Chd1 regulates ES cell self-renewal 


We have recently characterized the transcriptional profiles of pluri- 
potent stem cells, including ES cells, and the cells in the mouse embryo 
from which they are derived (ref. 7 and G. Wei, R.-F. Yeh, M. Hebrok 
and M.R.-S., unpublished observations). These studies led to the iden- 
tification of chromatin remodellers and transcription factors up- 
regulated in pluripotent cells. To test the role of 41 candidate factors 
in the regulation of pluripotency, we carried out an RNA interference 
(RNAi) screen in ES cells (Supplementary Fig. 1). ES cells expressing 
green fluorescent protein (GFP) under the control of the Oct4 
promoter (Oct4-GiP) were infected with a short hairpin RNA 


(shRNA)-expressing lentiviral vector, pSicoR-mCherry*. Using 1-2 
shRNAs per candidate target gene, we identified 18 genes that when 
downregulated led to defects in expansion of ES cells, and 7 that led to 
lower activity of the Oct4 promoter. Chd1 was the only gene with 
phenotypes in both assays that had not been previously implicated 
in ES cells (Supplementary Fig. 1). 

Chd1 is a chromatin-remodelling enzyme that belongs to the chro- 
modomain family of proteins and contains an ATPase SNF2-like 
helicase domain’. The two chromodomains in Chd1 are essential 
for recognition of histone H3 di- or tri-methylated at lysine 4 
(H3K4me2/3; ref. 10), and Chd1 has been implicated in transcrip- 
tional activation in yeast'', Drosophila’? and mammalian cells’’. 
Recent transcription factor location studies indicate that the Chd1 
gene is bound in mouse ES cells by Oct4 (also known as Pou5f1), 
Sox2, Nanog, Smad1, Zfx and E2fl1, suggesting that it is a target of 
several regulators of pluripotency and self-renewal". 

RNAi against Chd1 in Oct4-GiP ES cells, using two independent 
shRNAs targeting different regions of the messenger RNA, led to a 
decrease in the expansion of ES cells and to lower Oct4—GFP levels 
(Fig. la, b and Supplementary Fig. 2a). Control cells were infected with 
empty pSicoR-mCherry or with pSicoR-mCherry expressing an shRNA 
targeting GFP (empty and GFP RNAi, respectively), and behaved like 
uninfected cells (Supplementary Figs 1d and 2a). Downregulation of 
Chd1 mRNA after RNAi was confirmed by reverse transcription 
followed by quantitative PCR (qRT-PCR) (Supplementary Fig. 2b). 
Endogenous Oct4 downregulation was confirmed in Chd1-deficient 
(Chd1 RNAi) ES cells (Supplementary Fig. 3a). Oct4 downregulation 
induced differentiation into the trophectoderm lineage!’ (marked by 
Cdx2 and Eomes), unlike knockdown of Chd1, indicating that the Chd1 
RNAi phenotype is not simply one of trophectoderm differentiation 
due to the loss of Oct4 (Supplementary Fig. 3). 

Chd1 downregulation decreased clonogenic potential in two inde- 
pendent ES cell lines (Oct4-GiP and E14), but Chd1 RNAi cells were 
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Figure 1| Chd7 RNAi ES cells have decreased self-renewal but maintain 
expression of markers of the undifferentiated state. a, A competition assay 
shows that downregulation of Chd1 in Oct4-GiP ES cells using two 
independent shRNAs (Chd1 RNAil and Chd1 RNAiA4) leads to a decreased 
proliferative capacity. The proliferation index represents the population of 
mCherry~ cells relative to cells infected with empty vector, at each time 
point. Experiments were done in triplicate and are represented as 

mean and s.d. (n = 3). b, Downregulation of Chd1 leads to reduced activity 
of the Oct4—GFP reporter, as measured by the ratio between the percentage 
of GFP-negative cells in RNAi and in empty vector. Chd1 RNAi cells have a 
twofold increase of GFP-negative cells relative to controls for at least one 
passage. Downregulation of a different gene that affects only proliferation 


still able to form ES-like colonies (Fig. 1c), unlike Oct4 RNAi ES cells. 
ES cell clones constitutively expressing either of the two shRNAs 
against Chd1 were established, and sustained Chd1 downregulation 
was verified by qRT-PCR (see below Supplementary Fig. 5) and 
western blot (Fig. 1d). Control lines were established using empty 
and GFP RNAi viruses. As described later, the two shRNAs targeting 
Chd1 led to identical phenotypes in marker gene expression, tran- 
scriptional profile, differentiation potential and chromatin state, rela- 
tive to controls. Results were validated in the two independent ES cell 
lines Oct4-GiP and E14. The data are from analyses in standard E14 
ES cells not expressing GFP. Chd1 RNAi ES cells, even though they 
have a self-renewal defect, form compact colonies and express 
markers of ES cells, such as SSEA1, alkaline phosphatase and Oct4 
(Fig. le), indicating that they maintain at least some aspects of the 
undifferentiated state. 


Chd1 is required for ES cell pluripotency 

To gain insight into the state of Chd1 RNAi ES cells, we determined 
their global gene expression profiles using Affymetrix mouse Gene 1.0 
ST microarrays (Supplementary Fig. 4). We anticipated that we would 
find a pattern of downregulated genes in Chd1 RNAi cells, because 
Chd1 is known to be associated with active transcription’. As 
expected, both Chd1 and Oct4 were found to be downregulated in 
Chd1 RNAi ES cells. Surprisingly, however, very few other genes were 
significantly downregulated (only 25 genes were downregulated more 
than twofold and none more than threefold at 90% confidence, 
Supplementary Fig. 4b and Supplementary Data 1). These data indicate 
that, at least with the low levels of Chd1 still present in Chd1 RNAi ES 
cells, there is a global maintenance of the ES cell transcriptome. On the 


864 


Chd71 RNAi4 


NATURE| Vol 460|13 August 2009 


e 
GFP RNAi i 
bs sR, oy 


(Cebpz) has little effect on Oct4—GFP expression. Data are mean and s.d. 
(n = 3). ¢, The efficiency of colony formation in Chd1 RNAi ES cells was 
decreased relative to controls (empty and GFP RNAi). Data are mean and 
s.d. (n = 3), and are representative of two independent experiments. 

d, Immunoblot with whole cell extracts from parental E14 cells, GFP RNAi 
and two Chd1 RNAi ES cell clones (C1i6 and C4i2) using antibodies against 
Chd1 or o-tubulin. e, Chd1 RNAi cells still express markers of 
undifferentiated ES cells, such as alkaline phosphatase (AP; shown in bright 
field; original magnification, X 100), and SSEA1 and Oct4 (shown with 
immunofluorescence; original magnification, 400). Nuclei were stained 
with 4,6-diamidino-2-phenylindole (DAPI). 


Oct4 DAPI SSEA1 AP staining 


DAPI 


other hand, a larger group of genes was upregulated in Chd1 RNAi ES 
cells (Supplementary Fig. 4b and Supplementary Data 1). A Gene 
Ontology (GO) analysis of the list of upregulated transcripts showed 
a significant enrichment for genes involved in neurogenesis 
(Supplementary Fig. 4c), such as nestin and Blbp (also known as 
Fabp7), which was confirmed by qRT-PCR (Supplementary Fig. 5d). 

The maintenance of the ES cell transcriptome and the unexpected 
expression of neural markers were further analysed by immunofluor- 
escence for nestin, Blbp and Oct4. Oct4 was detected in Chdl RNAi 
colonies, but cells between the colonies stained strongly for Blbp 
(Fig. 2a) and nestin (Supplementary Fig. 4e). No staining for nestin 
or Blbp was detected in control ES cells. These results were confirmed 
with fluorescence-activated cell sorting (FACS) using the ES cell mar- 
ker SSEA1 (Supplementary Fig. 5). In summary, Chd1 RNAi ES cells 
can be propagated with many of the hallmarks of the undifferentiated 
state, but have a high propensity for neuronal differentiation. 

We next tested the differentiation potential of Chd1 RNAi ES cells 
in vitro by the formation of embryoid bodies. Chd1 RNAi embryoid 
bodies did not form the typical outer layer of primitive endoderm, as 
marked by immunofluorescence of embryoid body sections with Afp 
and Gata4 (Fig. 2b). Similarly, yolk sac endoderm cysts were reduced 
or not observed in Chd1 RNAi embryoid bodies (Supplementary Fig. 
6a), which showed downregulation of primitive endoderm markers 
(Gata4, Afp, Hnf4a and Lamb) by qRT-PCR (Supplementary Fig. 
6b). The loss of primitive endoderm in Chd1 RNAi embryoid bodies 
was comparable to that in embryoid bodies lacking an essential regu- 
lator of primitive endoderm, Gata6 (ref. 16) (Fig. 2b). Beating foci, 
indicative of cardiac mesoderm differentiation, were not detected 
in Chd1 RNAi embryoid bodies, whereas they could be readily 
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Figure 2 | Chd1is required for ES cell pluripotency. a, Immunofluorescence 
analysis of Chd1 RNAi cells shows expression of Blbp (red), but in a 
population not expressing the ES cell marker Oct4 (green). Original 
magnification, <400. b, Paraffin sections of 6 day embryoid bodies were 
stained for Afp (red) and Gata4 (green), or haematoxylin and eosin (H&E). 
The loss of the primitive endoderm layer (highlighted in control GFP RNAi 
by white arrowheads) in Chd1 RNAi embryoid bodies is similar to that 


quantified in control embryoid bodies (Supplementary Fig. 6c). 
However, beating foci were also not observed in Gata6é ’~ embryoid 
bodies, indicating that the loss of cardiac mesoderm differentiation 
in Chdl RNAi embryoid bodies may be secondary to the loss of 
primitive endoderm. Immunostaining of Chdl RNAi embryoid 
bodies plated on matrigel showed a marked increase of neurons 
(stained with Tujl) and astrocytes (stained with Gfap) relative to 
controls (Fig. 2c). This increase in neural differentiation is not sec- 
ondary to the loss of primitive endoderm, because Gataé ’~ embry- 
oid bodies did not show such phenotype (Fig. 2c). Furthermore, 
Chd1 RNAi ES cells gave rise to teratomas with abundant neuronal 
differentiation when compared to wild-type ES cells (Supplementary 
Fig. 7). These results indicate that downregulation of Chd1 leads to 
loss of primitive endoderm, with consequential loss of cardiac meso- 
derm differentiation, and abnormally high levels of neural differenti- 
ation that derives from a propensity already detected in the 
undifferentiated state. 


Chd1 is a euchromatin protein in ES cells 


We then sought to understand the potential changes in the chromatin 
state of Chd1 RNAi ES cells that may underlie their differentiation 
defects. Previous studies'’* indicated that Chd1 associates with 
euchromatin by binding to H3K4me3, although genome-wide loca- 
tion studies had not been performed. We carried out chromatin 
immunoprecipitation (ChIP)-chip for Chd1 in wild-type ES cells, 
and compared the genome-wide location of Chdl to that of 
H3K4me3, RNA polymerase II (Pol II) and H3K27me3. These data 
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observed in embryoid bodies mutant for Gata6. Original magnification, 
100. ¢, A significant increase in neural differentiation is observed, as 
detected by staining embryoid bodies (plated on matrigel) for astrocytes 
(GFAP in green) and neurons (Tujl in red), in 12 day Chd1 RNAi embryoid 
bodies, relative to controls. Nuclei were stained with DAPI. Original 
magnification, X200. 


showed that Chd1 binding strongly correlates with that of Pol II and 
H3K4me3 (Fig. 3 and Supplementary Fig. 8). Bivalent domains, 
simultaneously enriched for both the activating H3K4me3 mark 
and repressive H3K27me3 mark’, are largely devoid of Chd1 
(Fig. 3a and Supplementary Fig. 8). Interestingly, GO analysis indi- 
cated that the strongest Chd1 and Pol II targets are enriched for roles 
in DNA binding, translation and chromatin assembly genes, and that 
this enrichment is not strictly correlated with expression levels 
(Fig. 3b and Supplementary Data 2). Chd1 binding also correlates 
with H3K4me3 enrichment after differentiation: during embryoid 
body formation, the levels of both H3K4me3 and Chd1 are decreased 
at the Oct4 promoter and increased at the endodermal regulator 
Gata4 promoter (Supplementary Fig. 9). These data indicate that 
Chd1 associates globally with euchromatin in ES cells, and may 
preferentially target genes with roles in chromatin organization 
and transcription. 


Chd1 is required for maintenance of open chromatin 


To investigate the effects of Chdl downregulation on ES cell chro- 
matin, we performed immunofluorescence for histone marks of 
euchromatin and heterochromatin. Surprisingly, foci of heterochro- 
matin marks such as H3K9me3 and HP1y (also known as Cbx3), which 
normally appear as dispersed foci in ES cells*, were markedly increased 
in Chd1 RNAiES cells (Fig. 4a and Supplementary Fig. 10). No obvious 
differences were observed in staining for H3K4me3 or H3K27me3 
between Chd1 RNAi ES cells and controls (data not shown). As 
described earlier, Chd1 RNAi ES cells are prone to spontaneous neural 
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Figure 3 | Chd1 associates with euchromatic promoter regions in ES cells. 
a, K-means clustering of Chd1, H3K4me3, H3K27me3 and RNA Pol II 
binding in ES cells. Chd1 binding correlates with binding of H3K4me3 and 
RNA PolII but is excluded from bivalent domains (cluster IV) in ES cells. 
Each row represents the binding pattern along the —5.5 kilobase (kb) to 
+2.5 kb promoter region relative to the transcription start site (TSS). 

b, Functional categorization of Chd1 targets. Gene Ontology (GO) terms 
associated with the 200 genes most strongly bound by Chd1 or RNA Pol II, 
or enriched for H3K4me3, as well as the top 200 genes in expression level in 
ES cells. Categories above an enrichment score of three (x axis) are 
considered significantly enriched. Sixty-nine genes overlap between the 
Chd1 and the RNA Pol II top 200 gene lists, versus 27 for Chd1 and 
expression, and 13 for Chd1 and H3K4me3. 


differentiation, and it has been shown that ES cell-derived neural pre- 
cursors accumulate heterochromatin’. It was therefore important to 
evaluate whether the accumulation of heterochromatin is a con- 
sequence of commitment to the neural lineage, or whether it is present 
in ES-like cells before differentiation. Co-staining with H3K9me3 and 
Oct4 revealed that Oct4-positive ES-like cells, located in the centre of 
compact colonies that stain for other markers of the undifferentiated 
state (Fig. le), have accumulated high levels of heterochromatin in 
Chd1 RNAi cells (Fig. 4a, quantified in 4b). Moreover, we analysed 
the global chromatin dynamics of Chd1 RNAi cells by fluorescence 
recovery after photobleaching (FRAP) using a GFP-tagged histone 
H1. H1 is a linker protein involved in condensing nucleosomes that 
has been shown to rapidly exchange in the hyperdynamic chromatin of 
undifferentiated ES cells’. H1 showed a significant decrease in recovery 
in heterochromatin of Chd1 RNAi ES cells, indicating that the rapid 
exchange of H1 is compromised (Supplementary Fig. 11). These results 
indicate that, despite a global maintenance of the transcriptome, 
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Figure 4 | Chd1 is required to maintain open chromatin in ES cells. 

a, Analysis of H3K9me3 staining by immunofluorescence. Co-staining for 
H3K9me3 and Oct4 distinguishes between ES-like cells (Oct4-positive) and 
differentiating cells (Oct4-negative, white arrowheads). Oct4-positive Chd1 
RNAi ES-like cells have increased heterochromatin foci. Scale bar, 10 um 
b, Quantification of the increase of heterochromatin foci per nucleus in 
Chd1 RNAi ES-like cells, as seen by H3K9me3 staining in Oct4-positive cells; 
P<0.0005. 


morphology, and marker gene expression of ES cells, Chdl RNAi 
ES-like cells are not fully undifferentiated: their chromatin is con- 
densed. 

Heterochromatin formation is induced by methylation of H3K9 
by the enzymes ESET (also known as Setdb1l), Suv39H1/2, G9a 
(Ehmt2) or Glp (Ehmt1)'*, and reversed by the action of H3K9 
demethylases such as Jmjdla (Kdm3a) and Jmjd2c (Kdm4c). 
Jmjdla and Jmjd2c have been shown to regulate genes expressed in 
ES cells and to repress differentiation’’. All of these H3K9 methyl- 
transferases and demethylases are expressed in Chd1 RNAi ES cells at 
similar levels to control ES cells (Supplementary Data 1). Therefore, 
the accumulation of heterochromatin in Chd1 RNAi ES cells is not 
likely to be due to the differential expression of known H3K9 methyl- 
transferases or demethylases. These results indicate that the capacity 
to induce heterochromatin formation exists in undifferentiated ES 
cells, despite the presence of H3K9 demethylases, and that heterochro- 
matinization is countered, directly or indirectly, by Chd1l. Our data 
suggest that ES cells exist in a dynamic state of opposing epigenetic 
influences of euchromatin and heterochromatin, and that the 
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euchromatin protein Chd1 is required to maintain the heterochroma- 
tin-poor pluripotent stem cell state. 


Chd1 is required for efficient induction of pluripotency 

Given the role of Chd1 in maintaining pluripotency of ES cells, we 
hypothesized that it may also be involved in the re-acquisition of 
pluripotency during somatic cell reprogramming. We analysed the 
effect of Chd1 downregulation (Fig. 5a, b) in the reprogramming of 
Oct4—GFP mouse embryonic fibroblasts (MEFs) to induced pluri- 
potent stem (iPS) cells****. Downregulation of Chd1, using two 
independent shRNAs in three separate experiments, led to a signifi- 
cant reduction in the number of iPS cell colonies, scored both by 
morphology and GFP expression. This was not due to a delay in 
colony formation, as colony counts at later time points showed the 
same relative reduction in reprogramming efficiency after Chd1 
RNAi treatment (data not shown). The iPS cell colonies that did 
form in Chd1 RNAi wells either had not been infected by the 
RNAi virus or had silenced it, as assessed both by Chdl qRT-PCR 
and mCherry fluorescence (Supplementary Figs 12 and 13). 
Downregulation of Chd1 could potentially affect proliferation of 
MEFs, which would confound the calculation of reprogramming 
efficiency. However, no significant changes in MEF growth rates were 
found between control and Chd1 RNAi (Fig. 5c). Moreover, Chd1 
RNAi does not alter the expression level of the exogenous reprogram- 
ming factors (data not shown). In summary, our data show that 
downregulation of Chd1 does not affect the expansion of fibroblasts 
but inhibits their reprogramming by induction of pluripotency. 
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Figure 5 | Chd1 is required for efficient induction of pluripotency. a, The 
percentage of reprogrammed colonies was scored both by morphology and 
GFP expression, and normalized to the total number of colonies obtained in 
the control (four factors only, ‘4f). The values are represented as 

mean and s.d. of the averages of three independent experiments, each one 
done in duplicate or triplicate. The total number of colonies in control wells 
(4f or 4f + empty) in the three separate experiments varied between 200 and 
500 per 6-well. The efficiency of induction of pluripotency is significantly 
reduced after Chd1 RNAi treatment. Unpaired t-test was performed using 
the total number of colonies obtained, comparing the control (empty) with 
RNAi against Chd1 (Chd1 RNAil and Chd1 RNAi4). *P < 0.0001, 

**P < 0.00001. b, Chd1 downregulation after RNAi was confirmed by 
qRT-PCR. The values are represented as mean of absolute 

expression and s.d. (n = 3). ¢, Mean growth rate of MEFs measured by the 
MTT assay and s.d. (n = 6). No differences in MEF growth rate were 
observed after Chd1 RNAi treatment. 
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Discussion 


We show here that Chd1 is required for open chromatin and pluripo- 
tency of ES cells. ES cells have been reported to be ‘poised’ for differ- 
entiation by the presence of bivalent domains (marked by H3K4me3 
and H3K27me3) in developmental regulatory genes'’. We speculate 
that the opposing influences of euchromatin and heterochromatin 
(marked by H3K9me3) may be a further mechanism for maintaining 
ES cells in a state poised for differentiation. Chd1 is also highly 
expressed in human ES cells relative to differentiated cells (Supplemen- 
tary Fig. 14)*°, suggesting that its role in pluripotent stem cells may be 
conserved. Interestingly, it is possible that other stem cells maintain 
their differentiation potential using a similar mechanism, because 
Chd1 has also been identified as a gene upregulated in adult haemato- 
poietic and neural stem/progenitor cells”*. Furthermore, our data show 
that Chd1 is required for efficient generation of iPS cells. Fibroblasts 
have much higher levels of heterochromatin than pluripotent stem 
cells’ (and data not shown), and therefore a global opening of the 
chromatin is expected to be a component of reprogramming. Our data 
suggest that Chd1 may contribute to opening the chromatin and enab- 
ling transcription-factor-mediated reprogramming to occur, although 
the precise mechanisms remain to be determined. 

Itis unclear how Chd1, a protein associated with euchromatin, acts 
to counter heterochromatinization. Recent genetic studies in yeast 
indicate that euchromatin-associated factors prevent spreading of 
heterochromatin to euchromatic regions*”’*. Further studies will 
be required to determine whether Chd1 has a similar role in ES cells. 
An intriguing possibility is that Chd1 may mediate incorporation of 
the histone variant H3.3, which is generally associated with active 
genes and is less prone to H3K9 methylation”’. In support of this 
model, Chdl has recently been shown to be required in the 
Drosophila oocyte for incorporation of H3.3 into sperm chromatin, 
a step necessary for development”. In addition, a genome-wide 
analysis of the chromatin state of Chd1 RNAi ES cells may provide 
insight into the differential sensitivity of endoderm and neural 
lineages to chromatin condensation. 


METHODS SUMMARY 


The RNAi screen was performed in Oct4-GiP ES cells using the lentiviral vector 
pSicoR-mCherry. A proliferation index was derived from a competition assay 
between infected and non-infected cells analysed by flow cytometry over five 
passages. Oct4-GiP and E14 Chd1 RNAiand control ES cell clones were expanded 
from mCherry* cells isolated by FACS, and validated by qRT-PCR for Chd1 
downregulation. ES clones and embryoid bodies were grown in standard culture 
conditions. For expression microarray experiments, RNA from E14 Chd1 RNAi 
and control clones was amplified and hybridized to Affymetrix Mouse Gene 1.0 
ST arrays. Normalized data was analysed in dChip. Gene Ontology analyses were 
done with MAPPFinder or DAVID. For ChIP-chip, immunoprecipitated DNA 
was amplified and hybridized to Agilent G4490 promoter arrays. Data were 
extracted as previously described*' and visualized using Cluster 3.0 and 
Treeview. ChIP, qRT-PCR and western blotting were performed using standard 
protocols. Immunofluorescence was performed on matrigel-coated chamber 
slides, with ES cells or embryoid bodies plated 2 days before fixation, or on 
paraffin-sectioned embryoid bodies. Teratomas were generated by subcutaneous 
injection of ES cells into SCID mice. FRAP analysis’ and induction of pluripo- 
tency™ were performed as described previously. Cell proliferation was measured 
using the MTT assay. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

ES cell culture and differentiation. Mouse E14 and Oct4-GiP ES cells’? were 
plated on 0.1% gelatin-coated plates or on a feeder layer of irradiated MEFs, and 
maintained in DMEM (Invitrogen) supplemented with 15% knockout serum 
replacement (Invitrogen), 1 mM 1-glutamine, 0.1 mM nonessential amino acids, 
100u1gml~! penicillin, 100,1gml~’ streptomycin, 1mM_ sodium pyruvate, 
0.1 mM 2-mercaptoethanol and recombinant LIF. Mouse Gataé /~ ES cells'® were 
grown in identical conditions except that fetal bovine serum was used instead of 
knockout serum replacement. Embryoid bodies were formed by plating ES cells in 
non-attachment conditions (suspension culture) with ES cell medium with fetal 
bovine serum and in the absence of LIF, for 16 days. Contractile foci were counted 
under an inverted microscope using triplicates of 10-cm dishes per ES cell clone. 
RNAi and competition assay. The genes tested in the RNAi screen were the 
following: Ap2gamma (also known as Tcfap2c), Brcal, Cebpz, Chdl, Ddx18, 
Dmrt1, Dppa2, Dppa3 (Stella), Dppa4, Eed, Foxd3, Hells, Mybl2, c-Myc, Mycbp, 
Nmycl (Mycn), Nanog, Nfya, Nfyb, NrOb1, Nr5a2, Oct4, Pramel4, Pramel5, Rex1 
(Zfp42), Rex2, Rbm35a (Esrp1), Sall4, Six4, Sox2, Suz12, Tcfcp2l1, Terfl, Tex292 
(Cirhla), Utfl, Zic3, A030007L17Rik (Ggc2), and Affymetrix MG_U74Av2 
probe sets 160906_i_at, 135189 _f at, 97154_f_at and 98524 _f at. shRNA 
sequences were selected according to published criteria’: GFP RNAi, 
ACAGCCACAACGTCTATAT; Oct4 RNAi, GAACCTGGCTAAGCTTCCA; 
Chdl RNAil, ACATTATGATGGAGCTAAA; Chdl RNAi4, GTGCTACT 
ACAACCATTTA. All other sequences are in Supplementary Table 1. 
Oligonucleotides coding for the shRNAs were designed and cloned into the 
lentiviral vector pSicoR-mCherry as described’. Lentiviruses were produced as 
described®. For transduction, 10° ES cells were incubated with virus in 1 ml of ES 
cell medium (multiplicity of infection 5-10). After 1h rotating at 37°C, 2.5— 
3 X 10° cells were plated per gelatinized well of a 12-well plate. A competition 
assay** was performed by analysing cells that were passaged every two or three 
days. Flow cytometry was performed on a LSRII and analysed using the Flojo 
software. Proliferation index was measured, for every passage, by dividing the 
percentage of mCherry* (shRNA) with the percentage of mCherry* (empty 
virus). Loss of Oct4d—GFP activity was measured by dividing the percentage of 
GFP cells (shRNA) with the percentage of GFP (empty virus). The calculation 
of the loss of Oct4/GFP expression was done with total GFP cells rather than 
just GFP" /mCherry~ to account for potential silencing of the mCherry con- 
struct after differentiation or non-cell autonomous effects. Proliferation index 
data are averages of triplicates (n = 3) and standard error bars. mCherry’ ES 
cells were isolated using a FACSDiVa (BD Biosciences) cell sorter. 

Colony formation assay and clonal derivation. E14 and Oct4-GiP ES cells were 
infected with lentiviruses containing shRNAs or empty virus alone, as described 
above. mCherry™ cells were sorted on day 5 after infection using a FACSDiVa 
(BD Biosciences) cell sorter. Five thousand cells were plated per 10-cm dish in 
triplicates. After 10 days in culture, individual clones were picked per each con- 
dition (empty virus, GFP RNAi, Chd1 RNAil or RNAi4) and propagated in 
standard ES cell growth conditions. Plates were stained for alkaline phosphatase 
using a Vector kit and colonies were counted. Results are averages of triplicates 
and standard error bars. 

Expression microarrays. Uninfected parental E14 ES cells, one clone infected 
with empty pSicoR-mCherry, one GFP RNAi clone and four Chd1 RNAi clones, 
three from Chd1 RNAiIl (C1i5, C1i6 and C1i9) and one from Chd1 RNAi4 (C4i2) 
were grown on gelatin in ES cell culture medium. Total RNA was isolated using 
the RNeasy kit (Qiagen) with in-column DNase digestion. Three-hundred nano- 
grams of total RNA per sample were amplified and hybridized to Affymetrix 
Mouse Gene 1.0 ST arrays according to the manufacturer’s instructions at the 
Genomics Core Facility of the Gladstone Institutes. These arrays assay for the 
expression of about 35,500 transcripts. Data were normalized using robust 
multi-array normalization. Hierarchical clustering and calculations of differ- 
ential gene expression were done using dChip (http://www.dchip.org)**. The 
full normalized data are in Supplementary Data 1. The lower bound of the 
90% confidence interval of the fold change (LCB) was used as a conservative 
estimate of the fold change. Five-hundred-and-thirty-one transcripts with 
LCB > 2 in Chd1 RNAi relative to controls were analysed in MAPPFinder’® 
for enrichment of gene ontology terms. Terms with Pvalues adjusted for mul- 
tiple testing = 0.01 were considered enriched. 

Immunohistochemistry. ES cells or embryoid bodies were plated on chamber 
glass slides pre-coated with matrigel. ES cells were plated on a layer of irradiated 
MEFs. After 2 days, cells were fixed with 4% paraformaldehyde, permeabilized 
with PBT (PBS plus 0.1% Triton X-100) and blocked with 2% BSA plus 1% goat 
or donkey serum in PBT. Slides were immunostained with primary antibody in 
blocking solution. Alternatively, embryoid bodies in suspension (at day 6) were 
fixed, paraffin-embedded, sectioned (50 jim), and stained for haematoxylin and 
eosin or immunostained. 
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Primary antibodies used: SSEAI (MC-480, Developmental Studies 
Hybridoma Bank (DSHB), 1:200), Oct4 (sc5279, Santa Cruz, 1:100; sc9081; 
Santa Cruz; 1:50), nestin (MAB353, Chemicon, 1:200), BLBP (ab32423, 
Abcam, 1:200), Afp (sc8977, Santa Cruz. 1:200), Gata4 (scl237, Santa Cruz, 
1:50), Tujl (MMS-435P, Covance, 1:250), GFAP (Z0334, Dako, 1:500), 
H3K4me3 (ab8580, Abcam, 1:200), H3K9me3 (07-449, Upstate, 1:100; 
ab8898, Abcam, 1:100), H3K27me3 (07-449, Upstate, 1:100), HPlgamma 
(MAB3450, Chemicon, 1:500). Secondary antibodies: Alexa Fluor 488/594 con- 
jugated secondary antibodies (anti-mouse, anti-rabbit, or anti-goat, 1:500, 
Molecular Probes). Nuclei were counterstained with DAPI. The MC-480 anti- 
body developed by D. Solter was obtained from the DSHB developed under the 
auspices of the NICHD and maintained by University of Iowa. 
qRT-PCR. RNA was isolated according to the RNeasy kit (Qiagen), and reverse- 
transcribed using the iScript first strand cDNAsynthesis kit (BioRad). The cDNA 
reaction was diluted 1:5 in TE buffer and used in Sybr Green real-time PCR 
reactions (BioRad). Housekeeping genes used were ubiquitin-b and ribosomal 
protein L7. PCR primer sequences are available on request. Reactions were run in 
replicates on a MyiQ qPCR machine (BioRad) according to the manufacturer’s 
instructions. Cycle threshold values were imported into the REST software’’ for 
fold-change calculations, using the housekeeping genes as controls. Values are 
presented in log, scale or in absolute expression levels compared with parental 
E14 RNA unless otherwise indicated. 

SSEA1 cell sorting. ES cells (GFP RNAi control and Chd1 RNAi clones) were 
collected by trypsinization, washed in ice-cold PBS, first resuspended in staining 
medium (HBSS, Ca/Mg-free, no phenol red, 2% FBS) with primary mouse 
antibody anti-SSEA1 (MC-480, DSHB, 1:50) for 30min on ice, and then in 
secondary anti-mouse IgM-PE (406507, BioLegends, 1:100) for 30 min on ice. 
Propidium iodide (P3566; Invitrogen) was added and live SSEAI* and SSEA1~ 
cells were isolated using a FACSDiVa (BD Biosciences) cell sorter. 

Western blotting. Whole cell extracts were prepared and measured using the 
Bradford assay (BioRad) for protein content. From whole ES cell extracts 30 pg 
of protein were resolved on SDS-PAGE gel (10%) using a rabbit antibody against 
Chd1 (1:2,000, from R. Perry'?) and a goat anti-rabbit HRP (1:10,000). The 
loading control used was o-tubulin, detected with a mouse antibody (1:1,000, 
Sigma T9026). From embryoid body extracts (at day 12), 10 tg of protein were 
resolved on a 10% SDS-PAGE gel using an antibody against Tujl (1:1,000) and 
anti-goat HRP (1:10,000). Detection was performed using the ECL kit according 
to manufacturer’s instructions (Amersham). 

FRAP analysis. Transfection of HI-GFP into ES cells and FRAP analysis were 
performed as described’. 

Generation of teratomas. Teratomas were produced by injecting 3 X 10° cells 
subcutaneously in the flanks of SCID mice. Tumour tissue samples developed in 
12 weeks and were fixed overnight in 4% paraformaldehyde before paraffin 
embedding. Sections were stained with haematoxylin and eosin with a standard 
protocol. 

ChIP. ChIP was performed essentially as described by Upstate Biotechnology, with 
some minor changes described below: chromatin was cross-linked by incubating 
cells on plates with PBS containing 20 mM dimethylpimelimidate (DMP; Sigma) 
and 0.25% dimethylsulphoxide (DMSO) for 1 h at room temperature. Cells were 
re-fixed with 2% paraformaldehyde for another hour at room temperature, 
scraped and centrifuged at 1,350¢ for 5 min. Pellets were resuspended in SDS lysis 
buffer and sonicated to obtain fragments of ~200-1,000 base pairs (bp) as verified 
on a gel. Reactions were centrifuged at 13,000g for 10 min and the supernatants 
were used. Antibodies used (3 ttg each): Chd1 (PAB-10569, Orbigen), H3K4me3 
(ab8580, Abcam) IgG (ab46540, Abcam). DNA was purified by phenol-chloro- 
form extraction, followed by ethanol precipitation. DNA concentration was deter- 
mined using a Nanodrop spectrophotometer (NanoDrop Technologies) and 
5-10 ng were used in Sybr Green real-time PCR reactions (see earlier) ran in 
duplicates or triplicates. Primer sequences are available on request. Fold enrich- 
ment over input was calculated using the 24¢ method corrected with IgG C, values. 
The HoxA3 primer set was used asa control gene because it corresponds toa region 
known to lack H3K4me3 (ref. 17). 

ChIP-chip. ChIP and hybridization onto Agilent promoter microarrays was 
performed as described”. In brief 500 j1g of crosslinked ES chromatin was immuno- 
precipitated with 10 ig of Chd1 antibody (Allele Biotech PAB-10568) or hypophos- 
phorylated RNA polymerase II (8WG16). The eluate was reverse crosslinked, 
RNase- and proteinase-K-treated and purified. Equal amounts of input and immu- 
noprecipitated samples were amplified using the WGA2 kit (Sigma), labelled with 
the Bioprime kit (Invitrogen) and hybridized onto Agilent mouse promoter arrays 
(G4490) according to manufacturer’s instructions. Data were extracted as previously 
described’', Data were visualized using the Cluster 3.0 and Treeview programs. 
Bound genes were determined using the Young laboratory algorithm**. H3K4me3 
and H3K27me3 data, as well as the algorithm to generate the 500-bp window 
presentation were previously published”. In K-means clustering, each row 
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represents the binding pattern along the —5.5 to +2.5 kb promoter region relative to 
the TSS, reiterated four times to present the data for each immunoprecipitation. The 
8kb promoter region is divided into sixteen 500bp fragments that display the 
average log ratio of probe signal intensity with blue, yellow and grey representing 
lower-than-average, higher-than-average and missing values for enrichment due to 
lack of probes in those regions, respectively. The odds ratio for binary correlation of 
Chd1 binding strength was calculated as the ratio of the probability of a gene being 
bound by Chd1 divided by the probability of it being bound by H3K4me3 (or RNA 
Pol II or H3K27me3) to the probability of a gene being bound by Chd1 divided by 
the probability of the gene being unbound by H3K4me3 (or RNA PollII or 
H3K27me3). 

Reprogramming. Reprogramming was performed as previously described”, 
with minor changes: Oct4—GFP MEFs were reprogrammed using lentiviral infec- 
tion (day 0) of four transcription factors (Oct4, Sox2, nMyc and KIf4). RNAi 
lentiviral vectors (empty, Chdl RNAil, Chd1 RNAi4) were used to infect 
MEFs 4 days before the addition of the four factors. At day 1, MEFs were also 
plated fora MTT assay to quantify growth rates and for RNA collection for gRT- 
PCR for Chd1. An optimization of the protocol was also used and described in 
Supplementary Fig. 12 (as reported previously”). iPS colonies were scored (13 to 
16 days after addition of the four factors) by GFP fluorescence, using a scale 
according to the number of cells in a colony that were GFP positive, as described 
in Supplementary Fig. 12b (GFP-positive refers to all the colonies with any GFP- 
positive cells), or by their morphology under bright field (GFP-negative). 
MTT assay. The growth rate of MEFs was measured using an indirect method. 
Yellow MTT is reduced by mitochondrial enzymes into a purple formazan and 
the absorbance measured as a result of the number of viable cells. For the MTT 
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5min at room temperature. Absorbance was recorded at 540nm using a 
Spectramax M2 microplate reader (Molecular Devices). 
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The diversity of type la supernovae from 


broken symmetries 


D. Kasen’, F. K. R6épke? & S. E. Woosley’ 


Type Ia supernovae result when carbon-oxygen white dwarfs in 
binary systems accrete mass from companion stars, reach a critical 
mass and explode. The near uniformity of their light curves makes 
these supernovae good ‘standard candles’ for measuring cosmic 
expansion’ ~, but a correction must be applied to account for the fact 
that the brighter ones have broader light curves’. One-dimensional 
modelling, with a certain choice of parameters, can reproduce this 
general trend in the width—-luminosity relation®*; but the processes 
of ignition and detonation have recently been shown to be intrin- 
sically asymmetric” "*, so parameterization must have its limits. Here 
we report multi-dimensional modelling of the explosion physics and 
radiative transfer, which reveals that the breaking of spherical 
symmetry is a critical factor in determining both the width- 
luminosity relation and the observed scatter about it. The deviation 
from spherical symmetry can also explain the finite polarization 
detected in the light from some supernovae". The slope and normali- 
zation of the width—luminosity relation has a weak dependence on 
certain properties of the white dwarf progenitor, in particular the 
trace abundances of elements other than carbon and oxygen. Failing 
to correct for this effect could lead to systematic overestimates of up 
to 2 per cent in the distance to remote supernovae. 

In the most established model for type Ia supernovae, a carbon 
fusion flame is ignited near the centre of the white dwarf and initially 
burns in a sub-sonic and turbulent deflagration, then transitions near 
the white dwarf surface to a supersonic detonation’*'®. The detona- 
tion is needed in order to match the observed energetics and nucleo- 
synthesis’’. The energetic stage of the explosion lasts only a second or 
so, but synthesizes radioactive °°Ni that powers the subsequent light 
curve and determines its luminosity. In previous one-dimensional 
models, the *°Ni yield depended on the choices of parameters 
representing the speed of the subsonic burning front and the density 
at which the front makes a transition to a detonation wave'*. 
However, these parameters are actually a consequence of multi- 
dimensional instabilities not captured in one dimension, and so were 
not highly constrained by physics. Ignition in the one-dimensional 
models also occurred at the very centre of the star, and the transition 
to detonation happened simultaneously on a symmetric spherical 
shell. Both of these assumptions are now doubted. 

The starting point of our simulations is a standard 1.4-solar-mass 
white dwarf composed of 50% carbon and 50% oxygen with a central 
density of 2.9 10°gcm™*. The propagation of the burning is 
followed in two spatial dimensions using a level set to track the 
flame’s location and a turbulent subgrid model adopted from the 
chemical combustion community to describe its motion'?. The 
resulting debris are then post-processed using a multi-dimensional 
radiative transport code” to determine the emergent radiation. 
Model variations consist of how the white dwarf is ignited and the 
criteria for making the transition to detonation, both of which are 


based on insights from recent three-dimensional studies, as described 
below. Variations in the trace abundance of elements other than 
carbon/oxygen (the metallicity) of the progenitor star over a range 
from one-third to three times solar are also explored, to mimic the 
evolution in supernova environment that may have occurred over 
aeons of cosmic time. 

Simulations show®”® that just before ignition the white dwarf 
undergoes a ‘simmer phase’ characterized by dipolar convection, with 
material flowing out of the centre in a directed plume then circulating 
back in through the opposite side. The hottest points in the flow, and 
therefore where ignition occurs, are found on one side of the star 
displaced slightly from the centre. As the simulations have been 
computed at much lower Reynolds number (~ 1,000) than the true 
star (~10'*), the real flow may be even less ordered, retaining a dipole 
flavour but contaminated by higher multipoles, making ignition a 
much more chaotic process. The geometry of ignition may also be 
influenced by the rotation rate of the star. In our models, we therefore 
varied the number of ignition points, their distance from the centre of 
the star, and the solid angle in which they were distributed. 

The physics of a putative transition to detonation, though still 
uncertain, has also been elucidated in recent studies*)**. As the 
deflagration flame propagates into lower-density material, it thickens 
and slows to the point that turbulent eddies can mix hot ‘fuel’ and 
cold ‘ash’, causing the burning rate to become highly irregular and 
potentially explosive. Like ignition, detonation should be a stochastic 
process which occurs only for the most extreme and intermittent 
turbulent energies, and may occur many times in different places. 
We therefore varied the criterion (the critical Karlovitz number) for 
instituting a detonation (Supplementary Information). 

Figure 1 shows the final chemical structure (100s after ignition) of 
the stellar debris for a sample of models. The inner regions of material, 
which were burned primarily in the turbulent, Rayleigh-Taylor 
unstable deflagration phase, consist of a patchy mixture of *°Ni, 
intermediate-mass elements, and stable iron-group isotopes pro- 
duced by electron capture (mostly **Fe and **Ni). The subsequent 
detonation produced a smoother distribution of intermediate-mass 
elements in the outer layers of the star. The synthetic model light 
curves, colours, and spectral time series agree very well with those of 
real events observed over the two months after explosion” (Fig. 2), 
offering a strong validation of the model’s predicted velocity structure 
and chemical stratification. On the other hand, it has been 
suggested that the mixing of electron-capture elements throughout 
the inner layers—a feature generic to all multi-dimensional deflag- 
ration calculations—may be inconsistent with spectral observations of 
some type Ia supernovae taken at late phases”. 

The models predict a range of *°Ni yields, from 0.3 to 1.1 
solar masses, depending on the initial conditions. Perhaps counter- 
intuitively, models with more robust ignition generally synthesize 
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less”°Ni. A strong ignition (that is, numerous, symmetrically distrib- 


uted ‘sparks’) increases the amount of burning in the deflagration 
phase during which the star expands and the density declines. This 
reduces the *°Ni produced in the subsequent detonation wave. A 
weak or asymmetrical ignition, on the other hand, gives little deflag- 
ration burning and minimal pre-expansion, so that the detonation 
synthesizes abundant °°Ni. For similar reasons, the *°Ni mass also 
depended on the detonation criteria, as noted in previous one- 
dimensional calculations”. 

Given the range in *°Ni masses, the model peak luminosities vary 
by a factor of three, (0.7~2.1) X 10* ergs” '; this spans the range of 
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Figure 1| Chemical structure of the 
ejected debris 100s after ignition 
for a subset of the explosion 
models with different ignition and 
detonation conditions. Blue, 
intermediate-mass elements (that 
is, silicon, sulphur, calcium); green, 
stable iron-group elements 
produced by electron capture; and 
red, °°Ni. The *°Ni production 
increases (left to right) for models 
which undergo relatively more 
burning in the detonation as 
compared to the deflagration phase 
of the explosion. The turbulent 
inner regions reflect 
Rayleigh-Taylor and other 
instabilities that develop during the 
initial deflagration phase of 
burning. The subsequent 
detonation wave enhances the *°Ni 
production in the centre by burning 
remaining pockets of fuel. The 
lower-density outer layers of debris, 
processed only by the detonation, 
consist of smoothly distributed 
intermediate-mass elements. 


normal type Ia supernovae, though does not reproduce the more 
extreme and peculiar sub- and super-luminous events”®. A substantial 
amount of the dispersion in luminosities of type Ia supernovae may 
therefore be the result of the stochastic and asymmetrical nature of the 
explosion itself, apart from any variations in the properties of the 
progenitor star. The r.m.s. dispersion in brightness depends on the 
wavelength band considered, being 45% in the blue, 27% in the red, 
and only 21% in the near-infrared (~1.25 um wavelength). This 
reflects a self-regulating property of the radiative transfer—dimmer 
models are relatively cooler, and so radiate a greater percentage of their 
flux at longer wavelengths. The models thus confirm observational 


Figure 2 | Synthetic multi-colour light curves 
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Figure 3 | Correlation of the peak brightness of the models with their light- 
curve duration and colour. The sample includes 44 models each plotted for 
30 different viewing angles with equal probability of being observed. Solid 
circles denote models computed with the most likely range of detonation 
criteria, while open circles denote more extreme values. a, Relation between 
the peak brightness Mz (measured in the logarithmic magnitude scale) and 
the light-curve decline rate parameter AMj;, defined as the decrease in 
B-band brightness from peak to 15 days after peak. The shaded band shows 
the approximate slope and spread of the observed width-luminosity relation 
(WLR). b, Relation between Mg and the colour parameter B — V measured at 
peak. The solid line shows the slope of the observed relation of ref. 6 but with 
the normalization shifted, as the models are systematically redder than 


indications that type Ia supernovae are nearly standard (as opposed to 
merely standardizable) candles in the near-infrared”. 

Without introducing any artificial tuning, the luminosity of the 
models correlates with the light-curve decline rate, giving a width— 
luminosity relation (WLR) similar to that observed (Fig. 3). A cor- 
relation is also found between brightness and the colour measured at 
peak. The r.m.s. scatter in the model WLR alone is 24%, while using 
both decline rate and colour reduces the dispersion to 21%—similar 
to, but slightly greater than, the ~16% that is observed”*. The larger 
diversity seen in the models suggests that additional important 
physics may constrain the ignition and detonation conditions to a 
range narrower than that considered here. As in observations, the 
calibration of the models can be improved by using additional 
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Figure 4 | Effect of the metal content of the progenitor star population on 
the WLR. The models explore two extreme values of the metallicity: 3 times 
(red points) and 0.3 times the solar value (blue points). For clarity, each 
model has been averaged over all viewing angles, and black lines connect 
similar explosion models of differing metallicity. The coloured lines are 
linear fits to the WLR of the two metallicity samples separately. The diversity 
introduced by metallicity variations follows the general width-luminosity 
trend, but the slightly different normalization and slope of the relation for 
different metallicity samples indicates a potential source of systematic error 
in distance determinations. 
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observed type Ia supernovae by ~7%, probably due to the approximate 
treatment of non-local thermodynamic equilibrium effects. In observational 
studies, these two relations are usually fitted jointly as: 

Mg = Mg + «(s — 1) — BI(B — V) pmax + (B — V)o], where s is a stretch 
parameter, (B — V) pmax is the colour measured at the light curve peak, and 
Mg, is the fitted B-band magnitude of a s = 1 supernova. We take the colour 
of a fiducial supernova, (B — V)o = 0, and determine stretch using the first- 
order relation s = 1 — (AM,; — 1.1)/1.7. We find for the models fitted values 
of « = 2.25, B = 4.45 and Mgo 19.27, which are in reasonable agreement 
with those derived from the recent observational sample of ref. 3: « = 1.52, 
fb = 1.57 and Mg = —19.31 + 5logio(H,/70), where H, is the Hubble 
parameter. 


information from the light curve. For example, including the shape 
of the light curve in several optical and near-infrared wavelength 
bands reduces the scatter to only 15%. 

For a given mass of *°Ni, the residual scatter in the model WLR 
reflects individual features of the supernova debris structure. The 
turbulent deflagration phase imprints density and chemical inhomo- 
geneities, which lead to variations in the timescale for photons to 
diffuse out of the debris. In addition, the global asymmetry—due to 
asymmetric ignition conditions or off-centre detonation points— 
gives rise to anisotropic emission, so that the brightness and duration 
of most models vary by 20-30%, depending of the angle from which 
they are viewed. Although the adoption of a two-dimensional geo- 
metry may exaggerate global asymmetries, spectropolarization 
observations reveal that type Ia supernovae typically possess aspheri- 
city near or just below the level predicted here’*. Dimmer supernovae 
tend to be more polarized, an observation consistent with our finding 
that dimmer models are more asymmetric owing to relatively more 
burning in the turbulent deflagration phase. 

There are both theoretical suggestions” and observational indica- 
tions® that the metallicity of the white dwarf will affect the *°Ni yield at 
the ~10% level. This is because the extra neutrons in trace elements 
such as 7’Ne lead to an increased synthesis of neutronized iron group 
elements (**Fe and *8Ni) at the expense of S°Ni. To test the first-order 
effect of metallicity on the light curves, we varied the *°Ni and metal 
abundances in the ejecta models according to predicted nucleosyn- 
thetic results. The resulting light curves (Fig. 4) show that both the 
peak luminosity and light-curve duration decline with metallicity, in 
a manner roughly consistent with the WLR. Application of the WLR 
should therefore partially correct for metallicity variations, but with a 
residual error due to the different slope and normalization of the WLR 
at different metallicity. For extreme metallicity variations (from 0.3 to 
3.0 times the solar value), this effect can lead to errors in the inferred 
distance to the supernova as large as 4%. The actual metallicity 
evolution over the range probed by cosmology experiments is much 
smaller than this, and we estimate that systematic errors in distance 
will probably be less than 2%. 

The models suggest that a substantial amount of the scatter in the 
observed WLR arises from the random substructures and viewing- 
angle effects that are predicted by simulations of multi-dimensional 
explosions. In cosmological standard candle applications, these 
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translate to easily reducible statistical errors. On the other hand, 
additional diversity arises from variations in metallicity and other 
properties of the progenitor star (for example, carbon/oxygen ratio, 
central density, rotation), which may introduce a source of system- 
atic error. Simulation offers one means to test how such variations 
influence the supernova light curve; for future dark-energy experi- 
ments, this will help to anticipate and limit errors arising from a shift 
in progenitor demographics over cosmic time. 
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Storms in the tropics of Titan 


E. L. Schaller’, H. G. Roe”, T. Schneider** & M. E. Brown? 


Methane clouds, lakes and most fluvial features on Saturn’s moon 
Titan have been observed in the moist high latitudes’*, while the 
tropics have been nearly devoid of convective clouds and have 
shown an abundance of wind-carved surface features like dunes”*. 
The presence of small-scale channels and dry riverbeds near the 
equator observed by the Huygens probe’ at latitudes thought 
incapable of supporting convection’®”’ (and thus strong rain) 
has been suggested to be due to geological seepage or other 
mechanisms not related to precipitation’*. Here we report the 
presence of bright, transient, tropospheric clouds in tropical 
latitudes. We find that the initial pulse of cloud activity generated 
planetary waves that instigated cloud activity at other latitudes 
across Titan that had been cloud-free for at least several years. 
These observations show that convective pulses at one latitude 
can trigger short-term convection at other latitudes, even those 
not generally considered capable of supporting convection, and 
may also explain the presence of methane-carved rivers and 
channels near the Huygens landing site. 

Insolation on Titan varies seasonally during Saturn’s 29.5-year orbit 
because of its 27° obliquity. The frequent presence of tropospheric 
clouds near the south pole at the southern summer solstice’” led to 
the hypothesis that seasonally varying insolation controlled Titan’s 
cloud locations’. Recent observations have shown that Titan’s clouds 
display more complicated behaviour, including clouds at southern 
mid-latitudes'*’* and clouds associated with lakes near the north pole’. 
Titan general circulation models have predicted that the majority of 
convective cloud activity should occur near the summer poles, with 
some cloud activity occurring at mid-latitudes, and very little in the 
tropics'®'*"®. Although there are differences in the precise locations 
and frequencies of clouds in these models, they all predict that the 
equatorial regions should generally be the driest locations on the moon 
throughout most of Titan’s year. In contrast with these predictions, 
images taken during the descent of the Huygens probe through Titan’s 
atmosphere in January 2005 revealed small-scale channels and streams 
that appeared to have been carved by fluids (presumably methane) at 
equatorial (10° S) latitudes'’”. The morphologies of these channels also 
suggested that a high precipitation rate was needed to form them”. 

Unlike the Earth, which is approximately 65% cloud covered 
throughout the year’’, observations of Titan since 1990 have generally 
revealed a small amount of tropospheric cloud activity, covering only 
a small fraction (less than 1%) of Titan’s disk’”°?!. However, on two 
occasions (September 1995 and October 2004; refs 22 and 23) clouds 
were observed to brighten dramatically, covering 5—7% of the surface, 
and, in the case of the October 2004 event, were found to last for at 
least a month. To determine the frequencies and causes of these short- 
lived cloud brightenings, which are thought to be associated with 
significant amounts of methane precipitation™, we developed a 
long-term observing programme with the National Aeronautics and 
Space Administration (NASA) Infrared Telescope Facility (IRTF) to 
determine the percentage of tropospheric cloud coverage of Titan’s 
disk on a frequent (nearly nightly) basis”. 


Although 138 nights of observations spread over 2.2 years revealed 
extremely low levels of cloud activity (0.3% coverage)**, on 13 April 
2008 we observed an increase in flux at surface- or troposphere- 
probing wavelengths of brightness comparable to that of the events 
from 1995 and 2004 (Fig. 1). The following night we triggered adap- 
tive optics observations with the Near Infrared Imager (NIRI) on the 
Gemini North telescope. These infrared images revealed a large cloud 
system in Titan’s troposphere centred at 29° S, 247° W and with a 
latitudinal extent of + 15°. In observations over the next three nights, 
the brightness centroid of this cloud system moved with an apparent 
eastward velocity of 3+ 1ms '. The system also spread out in a 
southeastward direction, and finally rotated completely off the limb 
on 17 April. On 16-17 and 20-23 April, clouds also appeared near the 
south pole (81-60°S), with brightnesses not seen in the previous 
three years’ (Figs 2 and 3). 

The south pole was not predicted to be capable of supporting con- 
vective clouds during the season these clouds were observed (one year 
before vernal equinox)'®'*”*, and clouds of this brightness had not 
been observed since November 2004 despite frequent imaging’ and 
IRTF spectral monitoring”. Thus the appearance of relatively bright 
south polar clouds 3—7 days after the largest increase in Titan cloud 
activity observed in over three years points to a causal relationship. 
The south polar clouds were probably instigated by an atmospheric 
teleconnection mediated by planetary Rossby waves, formed from the 
initial pulse of cloud activity that occurred between 9 and 13 April. 
Rossby waves are large-scale atmospheric waves familiar from the 
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Figure 1| Titan IRTF spectra. Near-infrared spectra of Titan from 28 March 
and 13 April 2008. Both spectra have approximately the same central 
longitude (227° W), but the spectrum from 13 April (red) shows increased 
flux in the transparent regions of Titan’s atmosphere (0.8, 0.95, 1.2, 1.6 and 
2.0 um) while staying the same brightness in the high-opacity regions. 
Gemini images from the following night show the presence of a large 
extended cloud centred at 29° S, 247° W. Large cloud outbursts such as these 
(where Titan’s clouds are seen to increase by over a factor of ten over typical 
levels) have been observed on only two other occasions’. The methane 
rainout from these large storms is thought to be responsible for carving 
Titan’s streams and valley networks'**. 


'Mnstitute for Astronomy, University of Hawaii, Honolulu, Hawaii 96822, USA. “Lowell Observatory, Flagstaff, Arizona 86001, USA. Geological and Planetary Sciences, *Environmental 
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14 April 2008 (251°) 15 April 2008 (273°) 16 April 2008 (296°) 


18 April 2008 (341°) 20 April 2008 (27°) 25 April 2008 (140°) 


a 


28 April 2008 (210°) 1 May 2008 (275°) 


Figure 2 | Gemini adaptive optics images of Titan. Titan sub-Earth 
longitude and Universal Time date are noted for each observation. Images 
were taken through the H2(1-0) (2.1 1m) filter that probes to Titan’s 
troposphere. Clouds were first detected on 13 April 2008 by our whole-disk 
spectroscopic monitoring programme with IRTF (Fig. 1). Clouds were 
subsequently detected in tropical, polar, and temperate latitudes using the 
Gemini adaptive optics system. Images from 28 April 2008 and 1 June 2008 
show a faint cloud persisting over the same location as the northwesternmost 
extent of the initial large cloud from 14 April 2008 (15° S, 250° W; green 
box), perhaps indicating that the initial cloud outburst may have been 
localized here (see also Fig. 3). 


Earth’s atmosphere, which can propagate in the zonal and meridional 
directions. They can instigate cloud activity by modifying the tem- 
perature structure of the atmosphere and by inducing convergent flow 
near the surface. For a wave generated in the tropics to reach 72°S 
latitude in 3-7 days, it needs to travel with a meridional velocity of 
3-8ms_'. Deep (barotropic) Rossby waves with wavenumbers of ~2 


14 April 2008 EN §=—-15 April 2008 ENN 16 April 2008 
21 April 2008 EN = 22 April 2008 ENN 23 April 2008 
1 May 2008 2 May 2008 8 May 2008 


Figure 3 | Titan cloud locations. Cassini ISS surface maps (from 

http://ciclops.org) with locations of clouds from 14 April to 8 May 2008. No 
clouds were observed on 23 April 2008. Clouds that are located poleward of 
55° S latitude are shown on the pole-projected surface map. Both maps show 
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have zonal and meridional group velocities consistent with the 
observed propagation velocities (see Supplementary Information). 
Such long waves are probably the only ones able to propagate out of 
low-latitude generation regions (see Supplementary Information). 

In addition to cloud activity near the south pole, on 18—20 April we 
also observed cloud activity from 20-12°S latitude, which are the 
closest latitudes to the equator we have ever observed cloud activity 
from the ground. Given the near lack of cloud activity at these low 
latitudes in all other observations (even from Cassini*””*), it is likely 
that these clouds were also triggered by the initial outburst. The near- 
equatorial clouds persisted for several days and had extended streak- 
like morphologies, similar to the mid-latitude clouds observed from 
2003 to 2006 (refs 14, 15). 

On 24 and 25 April 2008, the initial cloud rotated back onto the 
observable limb, decreased in brightness from that of the initial out- 
burst but still moving with the same velocity of ~3ms ' as in the 
initial three days of observations (Figs 2 and 3). South polar cloud 
activity was nearly non-existent. However, the image from 28 April 
shows that the same cloud system had grown significantly brighter, 
perhaps caused by the initial wave re-instigating cloud activity down- 
stream. This new pulse of cloud activity, comparable in brightness to 
the original pulse from 14 April, displayed a similar morphology to 
the clouds seen on 15 and 16 April. Similarly, days later, clouds were 
also observed near the south pole and at near-equatorial latitudes. 
These observations demonstrate that a convective pulse at one 
latitude can trigger cloud activity at other latitudes thought to be 
incapable of supporting convective cloud activity. The cloud activity 
observed near the south pole indicates that for this short time period, 
convection was remarkably vigorous. Clouds appeared and re- 
appeared over different longitudes only days apart, indicating the 
likely formation and reformation of these cloud systems (similar to 
what was observed in October 2004). The precise locations where 
clouds formed were probably related to local surface conditions and 
methane abundances. An extremely dry location on Titan might not 
form clouds even if wave activity favoured it—whereas a methane 
lake might need only a small trigger to generate cloud activity above it 
(note that all of the south polar clouds that formed during this event 
were centred near 72° S, the latitude of Ontario Lacus). 

Subsequent frequent observations from both Gemini and IRTF (30 
distinct nights) revealed little to no cloud activity from 15 May to the 
end of the observable Titan season from the ground (7 July), indi- 
cating that the total duration of the increased cloud activity was less 
than one month. The duration of this period of increased cloud 
activity matched that of the last such outburst, which occurred in 


25 April 2008 28 April 2008 


latitude lines in 30° increments. The northwesternmost latitudes 
encompassed by the initial pulse of cloud activity on 14 April show the 
continued presence of a small cloud for at least 19 days after the initial event, 
indicating a potential tie to the surface. 
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October 2004 and lasted for approximately 30 days. Large cloud 
outbursts such as these are thought to be associated with significant 
amounts of precipitation” and probably play a major part in shaping 
the geological features on the surface of Titan. 

One of the most intriguing questions is the cause and precise loca- 
tion of the initial pulse of cloud activity. Observations from 28 April 
and 1 May show that the northwesternmost regions encompassed by 
the initial pulse of cloud activity (15° S, 250° W) still remain bright in 
the 2.1-j1m images, indicating the continued presence ofa small cloud 
(Figs 2 and 3). This region could have been the location of the initial 
pulse of cloud activity given the subsequent southeastward velocity of 
the bright cloud features. The Cassini ISS surface map in this region is 
unremarkable. Although several locations on Titan have been sug- 
gested to show evidence for cryovolcanic activity'*” (specifically 
regions that are bright at 5 um), this region is not one of them. The 
difficulty of forming large convective clouds at tropical latitudes, 
combined with the continued brightness of this region for at least 
19 days, may point to a potential connection to the surface. If the 
surface were locally heated from below only slightly, parcels of air 
could be more easily lifted up to the level of free convection, leading 
to deep convective clouds. Additionally or alternatively, an injection 
of methane from the subsurface at this location could have a similar 
effect by lowering the level of free convection. The brightness and 
rapid formation of the initial cloud system indicates that the region 
centred on 15° S, 250° W is a good candidate location for observing 
surface changes due to methane rainout and, if the initial pulse were 
indeed tied to the surface, perhaps also recent cryovolcanism. 
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Dense packings of the Platonic and Archimedean 


solids 


S. Torquato’”""" & Y. Jiao* 


Dense particle packings have served as useful models of the struc- 
tures of liquid, glassy and crystalline states of matter’, granular 
media**, heterogeneous materials’ and biological systems**. 
Probing the symmetries and other mathematical properties of 
the densest packings is a problem of interest in discrete geometry 
and number theory®"'. Previous work has focused mainly on 
spherical particles—very little is known about dense polyhedral 
packings. Here we formulate the generation of dense packings 
of polyhedra as an optimization problem, using an adaptive 
fundamental cell subject to periodic boundary conditions (we term 
this the ‘adaptive shrinking cell’ scheme). Using a variety of multi- 
particle initial configurations, we find the densest known packings 
of the four non-tiling Platonic solids (the tetrahedron, octahedron, 
dodecahedron and icosahedron) in three-dimensional Euclidean 
space. The densities are 0.782..., 0.947..., 0.904... and 0.836..., 
respectively. Unlike the densest tetrahedral packing, which must 
not be a Bravais lattice packing, the densest packings of the other 
non-tiling Platonic solids that we obtain are their previously 
known optimal (Bravais) lattice packings. Combining our simu- 
lation results with derived rigorous upper bounds and theoretical 
arguments leads us to the conjecture that the densest packings of 
the Platonic and Archimedean solids with central symmetry are 
given by their corresponding densest lattice packings. This is the 
analogue of Kepler’s sphere conjecture for these solids. 

A large collection of non-overlapping solid objects (particles) in 
d-dimensional Euclidean space R¢ is called a packing. The packing 
density @ is defined as the fraction of space R@ covered by the particles. 
A problem that has been a source of fascination to mathematicians 
and scientists for centuries is the determination of the densest arrange- 
ment(s) of particles that do not tile space and the associated maximal 
density max (ref. 9). Most previous work has focused on spherical 
particles, but even for this simple shape the problem is notoriously 
difficult. Indeed, Kepler’s conjecture concerning the densest sphere 
packing arrangement was only proved by Hales in 2005 (ref. 10). 

Attention has very recently turned to finding the maximum- 
density packings of nonspherical particles in R’, including ellip- 
soids’’, tetrahedra'*™, and superballs'*. Very little is known about 
the densest packings of polyhedral particles that do not tile space, 
including the majority of the Platonic and Archimedean solids 
studied by the ancient Greeks. The difficulty in obtaining dense pack- 
ings of polyhedra is related to their complex rotational degrees of 
freedom and to the non-smooth nature of their shapes. 

The Platonic solids (mentioned in Plato’s Timaeus) are convex 
polyhedra with faces composed of congruent convex regular poly- 
gons. There are exactly five such solids: the tetrahedron, icosahedron, 
dodecahedron, octahedron and cube (see Fig. 1). An Archimedean 
solid is a highly symmetric, semi-regular convex polyhedron com- 
posed of two or more types of regular polygons meeting in identical 


vertices. There are thirteen Archimedean solids (see Fig. 1). We note 
that the tetrahedron (P1) and truncated tetrahedron (A1) are the only 
Platonic and Archimedean solids, respectively, that are not centrally 
symmetric. A particle is centrally symmetric if it has a centre C that 
bisects every chord through C connecting any two boundary points of 
the particle. We will see that this type of symmetry is fundamental in 
determining the nature of the dense packing arrangements. 

Some definitions are in order here. A lattice A in R? is an infinite 
set of points generated by a set of discrete translation operations 
(defined by integer linear combinations of a basis of IR?) (ref. 4). A 
(Bravais) lattice packing is one in which the centroids of the non- 
overlapping particles are located at the points of A, each oriented in 
the same direction. The space R® can then be geometrically divided 
into identical regions F called fundamental cells, each of which con- 
tains just the centroid of one particle. Thus, the density of a lattice 
packing is given by 


Vparticle 

p= te (1) 
where particle is the volume of a particle and Vy is the volume of a 
fundamental cell. A periodic packing of particles is obtained by 
placing a fixed non-overlapping configuration of N particles (where 
N= 1) with arbitrary orientations in each fundamental cell ofa lattice 
A. Thus, the packing is still periodic under translations by A, but the N 
particles can occur anywhere in the chosen cell subject to the non- 
overlap condition. The density of a periodic packing is given by 


N Vparticle 
=e 2 
p= Shes (2) 


We formulate the problem of generating dense packings of non- 
overlapping polyhedra within an adaptive fundamental cell subject to 
periodic boundary conditions as an optimization problem (see 
Methods Summary). We call this optimization scheme the ‘adaptive 
shrinking cell’ (ASC). Figure 2 illustrates a simple sequence of con- 
figuration changes for a four-particle packing. 

Finding the densest packings of regular tetrahedra is part of the 
eighteenth problem in Hilbert’s famous set of problems. The densest 
(Bravais) lattice packing of tetrahedra (which requires all of the 
tetrahedra to have the same orientations) has the relatively low 
density @'"'** — 18/49 = 0.367... and each tetrahedron touches 14 
others'®. Recently, Conway and Torquato showed that the densest 
packings of tetrahedra must not be Bravais lattice packings, and 
found packings with density as large as @~0.72 (ref. 13). 
P. Chaikin, S. Wang and A. Jaoshvili experimentally generated 
jammed disordered packings of nearly tetrahedral dice with 
¢@ ~0.75 (unpublished work). Chen™ has recently discovered a 
periodic packing of tetrahedra with ¢ = 0.7786..., which we call 
the ‘wagon-wheels’ packing because the basic subunits consist of 
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Figure 1| The five Platonic solids and the 13 Archimedean solids. The five 
Platonic solids are the tetrahedron (P1), icosahedron (P2), dodecahedron 
(P3), octahedron (P4) and cube (P5). The 13 Archimedean solids are the 
truncated tetrahedron (A1), truncated icosahedron (A2), snub cube (A3), 
snub dodecahedron (A4), rhombicosidodecahedron (A5), truncated 


two orthogonally intersecting ‘wagon wheels’. A wagon wheel 
consists of five contacting tetrahedra packed around a common edge 
(see figure la of ref. 13). 

We begin by solving the ASC scheme to obtain dense packings of 
tetrahedra using initial configurations based upon low-density 
versions of the aforementioned packings. Initial conditions based 
on periodic copies of the wagon-wheels packing with 72 particles 
per cell lead to the densest packing of tetrahedra reported to date 
with @ = 0.782021... (see Fig. 3). Its lattice vectors and other char- 
acteristics are given in the Supplementary Information. The pref- 
erence for face-to-face (not vertex-to-face) contacts and the lack of 
central symmetry ensure that dense tetrahedral packings must be 
non-lattice structures. 

To obtain dense packings of icosahedra, dodecahedra and octahedra, 
we use a wide range of initial configurations. These include multi- 
particle configurations (with N ranging from 20 to 343) of random 
‘dilute’ packings and a variety of lattice packings with a wide range of 
densities. For icosahedra, dodecahedra and octahedra, we obtain final 
packings with densities at least as large as 0.836315..., 0.904002... and 
0.947003..., respectively, which are extremely close in structure and 
density to their corresponding optimal lattice packings with 


a b 


Figure 2 | Sequential changes of a four-particle packing configuration 
according to the design variables in the ASC algorithm. By efficiently 
exploring the design-variable space, which consists of the particle 
configurational space and the space of lattices (owing to our use of an 
adaptive fundamental cell), the ASC scheme enables us to find a point in the 
design-variable space in the neighbourhood of the starting point that has a 
higher packing density than the initial density. The process is continued 
until the deepest minimum of the objective function (a maximum of packing 


A8& AQ A10 Alt Ai2 Ai3 


icosidodecahedron (A6), truncated cuboctahedron (A7), icosidodecahedron 
(A8), rhombicuboctahedron (A9), truncated dodecahedron (A10), 
cuboctahedron (A11), truncated cube (A12), and truncated octahedron 
(A13). The cube (P5) and truncated octahedron (A13) are the only Platonic 
and Archimedean solids, respectively, that tile space. 


pat = 0.836357... (ref. 17), pati? = (5+V/5) /8=0.904508 ... 
(ref. 17) and head =18/19=0.947368... (ref. 18), respectively. 
Figure 3 shows the optimal lattice packings of icosahedra, dodecahedra 
and octahedra, in which each particle contacts 12, 12 and 14 others, 
respectively. Our simulation results strongly suggest that the optimal 
lattice packings of the centrally symmetric Platonic solids are indeed the 
densest packings of these particles, especially since these arise from a 
variety of initial dilute multi-particle configurations within an adaptive 
fundamental cell. 

We can show that the maximal density ¢max of a packing of con- 
gruent nonspherical particles of volume Vparticle is bounded from 
above according to: 


upper bound: Vparticle 7% 

a < Goa =min Fo VI18 2 (3) 
where Vsphere is the volume of the largest sphere that can be inscribed in 
the nonspherical particle and / \/18 is the maximal sphere-packing 
density. The proof is given in the Supplementary Information. The 
upper bound (3) will be relatively tight for packings of nonspherical 
particles provided that the asphericity y (equal to the ratio of the 
circumradius to the inradius, see online-only Methods) of the particle 


c 


density) is obtained, which could be either a local or global optimum. Here 
we show such a sequence. a, An initial configuration of four particles. b, A 
trial move of a randomly selected particle (red) that is rejected because it 
overlaps another particle. This is determined precisely using the separation 
axis theorem”*. ¢, A trial move that is accepted, which results in a 
deformation and compression (small in magnitude) changing the 
fundamental cell shape and size as well as the relative distances between the 
particles. 
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Figure 3 | Portions of the densest packing of tetrahedra obtained from our 
simulations, and the optimal lattice packings of the icosahedra, 
dodecahedra and octahedra to which our simulations converge. All of these 
packings are at least locally jammed, that is, each particle cannot be 
translated or rotated while fixing the positions and orientations of all the 
other particles””**. We emphasize that even though the latter three cases 
begin with complex multi-particle initial configurations in the large 
fundamental (repeating) cell, they all converge to packings in which a 


is not large. Bound (3) cannot generally be sharp (that is, exact) for a 
nontiling, nonspherical particle, so any packing whose density is close 
to the upper bound (3) is nearly optimal, if not optimal. 

Figure 4 compares the density of the densest lattice packings of the 
Platonic and Archimedean solids to the corresponding upper bounds 
on the maximal density for such packings. The central symmetry of 
the majority of the Platonic and Archimedean solids and their asso- 
ciated relatively small asphericities explain the corresponding small 
differences between prattice and @ypPer bound and is consistent with our 
simulation findings that strongly indicate that their optimal arrange- 
ments are their respective densest lattice packings. 

Why should the densest packings of the centrally symmetric solids 
be their corresponding optimal lattice packings? First, face-to-face 
contacts allow such polyhedral packings to achieve higher densities 
because they enable the contacting centroids around each particle to 
come closer together. Second, face-to-face contacts are maximized 
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Figure 4 | Comparison of the densest known lattice packings (blue circles) 
of the Platonic and Archimedean solids'*'* to the corresponding upper 
bounds (red squares) obtained from bound (3). The large asphericity and 
lack of central symmetry of the tetrahedron (P1) and truncated tetrahedron 
(A1) are consistent with the large gaps between their upper-bound densities 
and densest-lattice-packing densities, and the fact that there are non-lattice 
packings with densities appreciably greater than #'*"'“* (green triangles). The 
truncated tetrahedron is the only non-centrally symmetric Archimedean 
solid, the densest known packing of which is a non-lattice packing with two 
particles per fundamental cell and a density at least as high as 23/ 

24 = 0.958333... (ref. 13). 
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smaller repeat unit contains only one centroid, that is, they all converge to 
Bravais lattice packings and, in fact, the corresponding densest lattice 
packings. a, Tetrahedral packing. We depict the 72 particles in the 
fundamental cell of this non-lattice packing. Within the cell, the particles are 
characterized by short-range translational order and a preference for face-to- 
face contacts (see Supplementary Information). b, Optimal lattice packing of 
icosahedra. ¢, Optimal lattice packing of dodecahedra. d, Optimal lattice 
packing of octahedra. 


when each particle has the same orientation because of the central 
symmetry and the equivalence of the three principal axes (associated 
with the small asphericity) of the solid. This is consistent with a lattice 
packing, the densest of which is the optimal one. These arguments in 
conjunction with our simulation results for the Platonic solids and 
rigorous bounds lead us to the following conjecture: The densest 
packings of the centrally symmetric Platonic and Archimedean solids 
are given by their corresponding optimal lattice packings. This is the 
analogue of Kepler’s sphere conjecture for these solids. 

There is no reason to believe that denser packings of tetrahedra 
cannot be achieved by using even better initial conditions than those 
based on the wagon-wheels packing and a larger number of particles. 
We observe that the densest packings of all of the Platonic and 
Archimedean solids reported here as well as the densest known pack- 
ings of superballs’’ and ellipsoids’? have densities that exceed the 
optimal sphere packing density #P8e" =x / V18=0.7408 .... These 
results are consistent with Ulam’s conjecture’, which may be 
violated if the convex particle has little or no symmetry, but a 
counterexample has yet to be given. 

How does our conjecture extend to other polyhedral packings? It is 
natural to group the infinite families of prisms and antiprisms” with 
the Archimedean solids. A prism is a polyhedron having bases that 
are parallel, congruent polygons and sides that are parallelograms. An 
antiprism is a polyhedron having bases that are parallel, congruent 
polygons and sides that are alternating bands of triangles. Prisms with 
an even number of sides and antiprisms are centrally symmetric and 
so it may be that Bravais lattices of such solids are optimal. However, 
prisms with an odd number of sides are not centrally symmetric and 
thus their optimal packings may not be Bravais lattices. In future 
work, we will determine whether our conjecture extends to prisms 
and antiprisms. 


METHODS SUMMARY 


The objective function in our ASC optimization scheme is taken to be the 
negative of the packing density #. Starting from an initial packing configuration 
in the fundamental cell, the positions and orientations of the polyhedra are 
design variables for the optimization. Importantly, we also allow the boundary 
of the fundamental cell to deform as well as shrink or expand such that there is a 
net shrinkage (increase of the density) in the final state. Thus, the deformation 
and compression/expansion of the cell boundary are also design variables. We 
are not aware of any packing algorithm that employs both a sequential search of 
the configurational space of the particles and the space of lattices via an adaptive 
fundamental cell that shrinks on average to obtain dense packings. The ASC has a 
number of novel features that distinguish it from previous packing algorithms 
that have been devised for spheres*'’, ellipsoids**”° and superballs’” (see online- 
only Methods for details). 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 29 April; accepted 24 June 2009. 


1. Bernal, J. D. in Liquids: Structure, Properties, Solid Interactions (eds Hughel, T. J.) 
25-50 (Elsevier, 1965). 
2. Zallen, R. The Physics of Amorphous Solids (Wiley, 1983). 
3. Torquato, S. Random Heterogeneous Materials: Microstructure and Macroscopic 
Properties (Springer, 2002). 
4. Chaikin, P.M. & Lubensky, T. C. Principles of Condensed Matter Physics (Cambridge 
Univ. Press, 2000). 
5. Edwards, S. F. in Granular Matter (eds Mehta, A.) 121-140 (Springer, 1994). 
6. Liang, J. & Dill, K. A. Are proteins well-packed? Biophys. J. 81, 751-766 (2001). 
7. Purohit, P. K., Kondev, J. & Phillips, R. Mechanics of DNA packaging in viruses. 
Proc. Natl Acad. Sci. USA 100, 3173-3178 (2003). 
8. Gevertz, J. L. & Torquato, S. A novel three-phase model of brain tissue 
microstructure. PLOS Comput. Biol. 4, e1000152 (2008). 
9. Conway, J. H. & Sloane, N. J. A. Sphere Packings, Lattices and Groups (Springer, 
1998). 
0. Hales, T. C. A proof of the Kepler conjecture. Ann. Math. 162, 1065-1185 (2005). 
1. Cohn, H. & Elkies, N. New upper bounds on sphere packings. |. Ann. Math. 157, 
689-714 (2003). 
2. Donev, A., Stillinger, F. H., Chaikin, P. M. & Torquato, S. Unusually dense crystal 
ellipsoid packings. Phys. Rev. Lett. 92, 255506 (2004). 
3. Conway, J. H. & Torquato, S. Packing, tiling and covering with tetrahedra. Proc. 
Natl Acad. Sci. USA 103, 10612-10617 (2006). 
4. Chen, E. R. A dense packing of regular tetrahedra. Discrete Comput. Geom. 40, 
214-240 (2008). 
5. Jiao, Y., Stillinger, F. H. & Torquato, S. Optimal packings of superballs. Phys. Rev. E 
79, 041309 (2009). 
6. Hoylman, D. J. The densest lattice packing of tetrahedra. Bull. Am. Math. Soc. 76, 
135-137 (1970). 
7. Betke, U. & Henk, M. Densest lattice packings of 3-polytopes. Comput. Geom. 16, 
157-186 (2000). 
8. Minkowski, H. Dichteste gitterformige Lagerung kongruenter Kérper. Nachr. Akad. 
Wiss. Gottingen Math. Phys. KI. 11 311-355 (1904). 
9. Gardner, M. The Colossal Book of Mathematics: Classic Puzzles, Paradoxes, and 
Problems Ch. 10, 135 (Norton, 2001). 


LETTERS 


20. Cromwell, P. R. Polyhedra (Cambridge Univ. Press, 1997). 

21. Jodrey, W.S. & Tory, E.M. Computer simulation of close random packing of equal 
spheres. Phys. Lett. A 32, 2347-2351 (1985). 

22. Rintoul, M. D. & Torquato, S. S. Hard-sphere statistics along the metastable 
amorphous branch. Phys. Rev. E 58, 532-537 (1998). 

23. Uche, O. U., Stillinger, F. H. & Torquato, S. Concerning maximal packing 
arrangements of binary disk mixtures. Physica A 342, 428-446 (2004). 

24. Donev, A., Torquato, S. & Stillinger, F. H. Neighbor list collision-driven molecular 
dynamics for nonspherical hard particles. |. Algorithmic details. J. Comput. Phys. 
202, 737-764 (2005). 

25. Donev, A., Torquato, S. & Stillinger, F. H. Neighbor list collision-driven molecular 
dynamics for nonspherical hard particles. Il. Applications to ellipses and 
ellipsoids. J. Comput. Phys. 202, 765-793 (2005). 

26. Golshtein, E. G. & Tretyakov, N. V. Modified Lagrangians and Monotone Maps in 
Optimization (Wiley, 1996). 

27. Torquato, S. & Stillinger, F. H. Multiplicity of generation, selection, and 
classification procedures for jammed hard-particle packings. J. Phys. Chem. B 105, 
11849-11853 (2001). 

28. Donev, A., Connelly, R., Stillinger, F. H. & Torquato, S. Underconstrained jammed 
packings of nonspherical hard particles: ellipses and ellipsoids. Phys. Rev. E 75, 
051304 (2007). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We are grateful to H. Cohn and J. Conway for comments on 
our manuscript. S.T. thanks the Institute for Advanced Study for its hospitality 
during his stay there. This work was supported by the National Science Foundation 
under award numbers DMS-0804431 and DMR-0820341. The figures showing 
the polyhedra were generated using the AntiPrism package developed by 

A. Rossiter. 


Author Contributions S.T. devised the algorithm and upper bounds, performed 
theoretical analysis, and wrote the paper. Y.J. implemented the algorithm, 
performed theoretical analysis, commented on the manuscript and created all of 
the figures. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. Correspondence and requests for materials should be 
addressed to S.T. (torquato@princeton.edu). 


879 


©2009 Macmillan Publishers Limited. All rights reserved 


doi:10.1038/nature08239 


METHODS 


The ASC optimization problem could be solved using various techniques, 
depending on the shapes of the particles. For example, for spheres, we have shown 
that linear programming techniques can efficiently produce optimal solutions 
(unpublished work). However, for polyhedra, the complex non-overlap condi- 
tions make the ASC scheme inefficient to solve using linear programming 
methods. For polyhedral particles, we solve the ASC optimization problem using 
a standard Monte Carlo procedure with a Metropolis acceptance rule for trial 
moves to search the design-variable space efficiently, which contains both the 
configuration space of the particles and the space of lattices. 

In our implementation, a polyhedral particle is represented by the position of its 
centroids as well as the coordinates of all its vertices relative to the centroid. 
Although this representation contains redundant information, it is a convenient 
way to deal with the rotational motions of the polyhedra. To search the configura- 
tion space of the particles, small random trial moves of arbitrarily selected particles 
are attempted sequentially for each particle. Each trial move is equally likely to bea 
translation of the centroid of the particle or a rotation of the particle about a 
randomly oriented axis through its centroid. 

The space of lattices is searched by deforming/compressing/expanding the 
fundamental cell, which is completely characterized by a strain tensor in the 
linear regime (that is, small strain limit). The trace of the strain tensor determines 
the volume change of the fundamental cell and is involved in the objective 
function. The off-diagonal components of the tensor determine the shape 
change of the cell. The positions of the particles centroids are relative coordinates 
with respect to the lattice vectors. When the strain tensor is applied to the lattice 
vectors, although the relative coordinates of the centroids remain the same, the 
Euclidean distances between the particles will change. Thus, the deformation/ 
compression/expansion of the fundamental cell at least in part allows for col- 
lective particle motions, which is more efficient for finding a direction in the 
design-variable space leading to a higher packing density. Moreover, it is the 
overall compression of the fundamental cell that causes the packing density to 
increase, not the growth of the particles, as in most molecular dynamics and 
Monte Carlo hard-particle packing algorithms'*’'. It should be noted that for 
polyhedral particles, an algorithm that employs particle growth with an adaptive 
non-shrinking fundamental cell is computationally less efficient than the ASC 
scheme, which fixes the particle size while allowing the cell to shrink on average. 
Whereas a ‘growth’ algorithm requires manipulating the coordinates of the 
vertices of all of the particles, our ‘shrinkage’ method (in which the particle sizes 
are kept fixed) requires only the computation of the six strain components. 


nature 


In the simulation, starting from an initial configuration of polyhedral particles, a 
trial configuration can be generated by moving (translating and rotating) a 
randomly chosen particle or by a random deformation and compression/expansion 
of the fundamental cell. If this causes interparticle overlaps, the trial configuration is 
rejected; otherwise, if the fundamental cell shrinks in size (which makes the density # 
higher), the trial configuration is accepted. On the other hand, if the cell expands in 
size, the trial configuration is accepted with a specified probability p,.., which is 
made to decrease as ¢# increases and approaches zero at the jamming limit” (that is, 
locally maximally dense packing) is reached. In particular, we find p,.., with an initial 
value Pace ~ 0.35 that decreases as a power law with an exponent equal to — 1, works 
well for most systems that we studied. The ratio of the number of particle motions to 
the number of cell trial moves should be greater than unity (especially towards the 
end of the simulation), because compressing a dense packing could cause many 
overlaps between the particles. Depending on the initial configuration, the magni- 
tudes of the particle motions and the strain components need to be chosen carefully 
to avoid the system getting stuck in some shallow local minimum. 

A crucial aspect of any packing algorithm is the need to check for interparticle 
overlaps under attempted particle motions. Hard polyhedron particles, unlike 
spheres, ellipsoids and superballs, do not possess simple ‘overlapping’ functions. 
(The overlap function of a pair of strictly convex and smooth particles is a 
function of the positions, orientations and shapes of the two particles, whose 
value indicates whether the two particles overlap or not, or whether they are 
tangent to one another.) The separation axis theorem” enables us to check for 
interparticle overlaps for polyhedra up to the numerical precision of the 
machine. In particular, the theorem states that two convex polyhedra are 
separated in space if and only if there exists an axis, on which the projections 
of the vertices of the two polyhedra do not overlap. The separation axis is either 
perpendicular to one of the faces of the polyhedra or perpendicular to a pair of 
edges, one from each polyhedron. Thus, this reduces the number of axes that 
need to be checked from infinity to [E(E— 1)/2 + 2F], where E and F are the 
number of edges and number of faces of the polyhedra, respectively. A pre-check 
using the circumradius and inradius of the polyhedra dramatically speeds up the 
simulations, that is, two particles are guaranteed to overlap if the centroidal 
separation is smaller than twice the inradius and guaranteed not to overlap if 
the centroidal separation is larger than twice the circumradius. The circum- 
sphere is the smallest sphere containing the particle. The insphere is the largest 
sphere than can be inscribed in the particle. 

The cell method and near-neighbour list**’? are also used to improve the 
efficiency of the simulation, but are appropriately modified to incorporate the 
adaptive fundamental cell. 
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Atlantic hurricanes and climate over the past 


1,500 years 


Michael E. Mann’, Jonathan D. Woodruff’, Jeffrey P. Donnelly® & Zhihua Zhang’ 


Atlantic tropical cyclone activity, as measured by annual storm 
counts, reached anomalous levels over the past decade’. The short 
nature of the historical record and potential issues with its 
reliability in earlier decades, however, has prompted an ongoing 
debate regarding the reality and significance of the recent rise’». 
Here we place recent activity in a longer-term context by comparing 
two independent estimates of tropical cyclone activity over the past 
1,500 years. The first estimate is based on a composite of regional 
sedimentary evidence of landfalling hurricanes, while the second 
estimate uses a previously published statistical model of Atlantic 
tropical cyclone activity driven by proxy reconstructions of past 
climate changes. Both approaches yield consistent evidence of a 
peak in Atlantic tropical cyclone activity during medieval times 
(around Ap 1000) followed by a subsequent lull in activity. The 
statistical model indicates that the medieval peak, which rivals or 
even exceeds (within uncertainties) recent levels of activity, results 
from the reinforcing effects of La-Nina-like climate conditions and 
relative tropical Atlantic warmth. 

A number of past studies have attempted to place modern Atlantic 
tropical cyclone activity in a longer-term context using regional proxy 
evidence of past landfalling Atlantic hurricanes (tropical cyclones with 
maximum sustained surface winds exceeding 74 miles per hour)**. 
Some studies* have sought to infer past changes in activity from 
plausible local conditioning factors such as wind strength and sea 
surface temperature (SST), though the interpretations of these studies 
have been contested®. Qualitative comparisons between palaeo- 
hurricane reconstructions appear to show some temporal coherence*’. 
However, no past studies have attempted to synthesize multiple 
records from distinct regions into a basin-integrated reconstruction 
of Atlantic hurricane activity. Moreover, no past studies have sought 
to quantitatively relate estimated variations in hurricane or tropical 


cyclone activity to reconstructions of the key large-scale climate factors 
known to have a significant influence on modern Atlantic tropical 
cyclone activity. Here we produce an empirical record of past 
landfalling Atlantic hurricane activity by combining information 
from multiple sedimentary records of hurricane-induced overwash. 
Further, we compare these resulting estimates to independent statis- 
tical model predictions of past tropical cyclone activity driven by 
proxy-based large-scale climate reconstructions. 

Sediment-based overwash reconstructions of hurricane landfall are 
limited in number, but span a wide geographic area across the North 
Atlantic basin affected by hurricanes. Our compilation includes (see 
Fig. 1) a site from the Caribbean (Vieques, Puerto Rico®'®), one from 
the US Gulf Coast’, one from the southeastern US coast'!, three from 
the mid-Atlantic coast (one from New York® and two from New 
Jersey'’*'*) and two from southeastern New England (one from 
Rhode Island’* and another from Massachusetts'*), yielding five 
distinct regional series. We obtained a probabilistic estimate of past 
basin-wide landfalling hurricane activity using an appropriately 
weighted combination of the information from these five regional 
series, and incorporating radiocarbon age model uncertainties. 

An independent estimate of past tropical cyclone activity was 
obtained using a statistical model for Atlantic tropical cyclone 
counts. This previously developed and validated*"* statistical model 
conditions annual Atlantic tropical cyclone counts on three key 
large-scale climate state variables tied to historical variations in 
Atlantic tropical cyclone counts: (1) the SST over the main develop- 
ment region (MDR) for tropical Atlantic tropical cyclones, which 
reflects the favourability of the local thermodynamic environment; 
(2) the El Nifio/Southern Oscillation (ENSO), which influences the 
amount of (unfavourable) vertical wind shear; and (3) the North 
Atlantic Oscillation (NAO), which affects the tracking of storms, 
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Figure 1| Overwash sediment records of landfalling hurricanes. Event 
histories are shown for New England (blue), Mid-Atlantic (red), the 
southeastern US coast (turquoise; grey denotes oyster-bed events not used 
for reasons discussed by ref. 28 and in the Supplementary Information), the 
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Gulf Coast (yellow) and the Caribbean (green). The horizontal width of 
shaded rectangles indicates the +1¢ age model uncertainties. Circles 
indicate historical hurricane events. 
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determining how favourable an environment they encounter. The 
statistical model was driven by proxy-based reconstructions!”"* of 
these three state variables (Fig. 2), yielding a predicted history of 
Atlantic tropical cyclone counts for past centuries. 

We compared the sediment-based record against the above stati- 
stical estimate of basin-wide tropical cyclone activity (Fig. 3), guided 
by a working assumption that an appropriately weighted composite of 
regional landfalling hurricane activity varies, at multidecadal and 
longer timescales, in rough proportion to basin-wide tropical cyclone 
activity. Although the validity of this assumption can (as discussed 
further below) be questioned, it is worth noting that the sediment- 
based record tracks the observed long-term changes in tropical cyclone 
count over the historical period remarkably well (inset, Fig. 3). On the 
basis of previously published results, the required storm strength 
required for overwash and deposition varies among sites'’, although 
qualitatively similar results were obtained assuming uniform sensi- 
tivity (category 3 or greater storms) among sites (Supplementary 
Information). We down-weighted the Vieques data after AD 1700 to 
account for an estimated®”° artificial inflation of overwash deposit 
occurrences at the site due to increased sedimentation rates in recent 
centuries. 

Our two entirely independent estimates of past tropical cyclone 
activity were found to be statistically consistent (that is, they overlap 
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within their estimated 95% confident intervals), with certain excep- 
tions, which are discussed below. Jointly, the two independent records 
suggest periods of high activity (that is, comparable to current levels) 
during a medieval era of roughly AD 900-1100. Both estimates also 
suggest a general decrease in the level of activity after about AD 1200°’. 

Of particular interest is the medieval peak in activity, which 
matches or even exceeds current levels of activity within uncertainties 
for the statistical model. The peak arises in the statistical model from 
a combination of (see Fig. 2) La-Nifa-like conditions during the 
medieval era which have been discussed elsewhere” and relatively 
warm SSTs in the tropical North Atlantic at that time, with both 
of these factors having a substantial role in the statistical model pre- 
dictions (Supplementary Information). In contrast, this interval is 
followed by a combination of relatively cold Atlantic SSTs and more 
El-Nino-like conditions in the tropical Pacific, leading to a relative 
lull in modelled tropical cyclone activity in subsequent 
centuries before the modern increase. This finding is in contrast to 
some other recent work’, because we do not find hurricane activity 
during the 1970s to be anomalously low in comparison with that over 
the past few centuries. 

There are also some noteworthy discrepancies between the two 
independent estimates of past Atlantic tropical cyclone activity 
provided in this study. There is some independent historical 
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Figure 2 | Proxy reconstructions used in statistical model estimates of 
Atlantic tropical cyclone counts. a, The MDR SST”; b, the Ninio3 SST"; and 
c, the boreal winter NAO". Positive indices are associated with enhanced 
(a) and diminished (b, ¢) tropical cyclone activity, respectively’®. The 
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corresponding instrumental series'”’* are shown for comparison. All series 


are decadally smoothed”, and 95% uncertainty intervals are indicated by 
yellow shading. 
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documentary evidence for increased tropical cyclone activity in the 
Caribbean during the 1760s—1780s”, and a modest peak at this time 
is evident in the statistical model estimate. The sediment-based 
estimate, however, displays a peak that is later (early nineteenth 
century), and of considerably greater magnitude. The medieval peak 
in the sediment-based record falls slightly later than in the statistical 
model estimate, and is of greater duration. The peak in activity indi- 
cated by the sediment record in the mid-fifteenth century is not seen 
in the statistical model results. 

There are a number of plausible explanations for the differences 
between the two records. Landfalling hurricanes do not vary in fixed 
proportion to the total number of storms generated on decadal time- 
scales'. A form of the well known ‘ergodic hypothesis’ holds that on 
increasingly longer averaging timescales (for example, the centennial 
timescale variations of interest in this study), variations in totals 
among the considerably sparser group of landfalling hurricanes will 
more closely mirror those in the larger group of basin-wide tropical 
cyclones. However, certain predictors, such as the NAO, influence 
not only the basin-wide activity but also the prevailing regions of 
landfall, and given an incomplete coastal observing network, a 
change in the latter could potentially masquerade as a change in 
the former. The sites used to produce a basin-wide sediment 
composite record may simply not be representative enough of the 
true, full basin-wide activity. This caveat applies in particular to the 
Caribbean, Gulf Coast and southeast US coast, because we are 
relying on just one record in each case to estimate past tropical 
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cyclone activity in these regions. A jackknife estimate of uncertainty 
based on the removal of any one of the five contributing regions 
(Fig. 3) nonetheless suggests that the main features of our basin-wide 
composite are reasonably robust (the individual jackknife surrogates 
are shown in Supplementary Information). 

The sediment record could be contaminated by influences unrelated 
to hurricane strikes such as alterations to barrier morphology**”’. Also, 
potential biases in the proxy-based palaeoclimate reconstructions!” 
used to drive the statistical model would of course lead to biases in the 
statistical model predictions themselves. Finally, there is the possibility 
that other potential tropical cyclone influences (for example, the West 
African monsoon’®) not accounted for in the statistical model (that is, 
that are not correlated with the three predictors used), may have had a 
more important role in the past than is evident during the modern 
interval. 

Such uncertainties and caveats notwithstanding, the striking 
consistency of certain key features such as the medieval peak in 
Atlantic tropical cyclone activity and subsequent lull using two 
entirely independent approaches to estimating past activity suggest 
that these features are real, and provides some degree of additional 
validation of our current understanding of the primary factors 
governing long-term changes in Atlantic tropical cyclone activity. 
Paths forward that may further improve our understanding include, 
among other things, development of a more extensive and diverse 
set of multi-proxy estimates of past landfalling hurricane activity, 
and improved reconstructions of the past histories of key large-scale 
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Figure 3 | Long-term Atlantic tropical cyclone counts. Modern Atlantic 
tropical cyclone counts (red) compared both with statistical model estimates 
of tropical cyclone activity based on modern instrumental (aD 1851-2006; 
black) and proxy-reconstructed (AD 500-1850; blue) climate indices and an 
estimate of basin-wide landfalling Atlantic hurricane activity (AD 500-1991) 
derived from regional composites of overwash sediments (green). All series 
were smoothed” at multidecadal (>40-year) timescales. The sediment 
composite record was standardized to have the same mean and multidecadal 
variance as the statistical model estimates. Uncertainties for the statistical 
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model estimates (grey shading, indicating 95% confidence intervals) take 
into account the uncertainty in the statistical model itself (grey shading), 
and—in the case of the proxy-reconstructed indices (grey shading), the 
additional uncertainty due to the uncertainties in the proxy-reconstructed 
climate indices. Uncertainties for the sediment composite record (thin 
dashed black curves indicating upper and lower limits of the 95% confidence 
interval) are derived from jackknifing of the full composite with respect to 
each of the five contributing regional estimates, as discussed in the text. 
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climate phenomena influencing Atlantic tropical cyclones such as 
ENSO. 


METHODS SUMMARY 

Sediment hurricane landfall records. Regional sediment series were weighted 
with respect to inverse modern return periods for landfalling tropical cyclones'” 
and summed to yield basin-wide composites of tropical cyclone activity. We used 
a Monte Carlo approach to generate an ensemble of such composites consistent 
with the event chronologies and age model uncertainties. A basin-wide landfalling 
hurricane activity series was defined by the maximum rate of activity for each year 
over this ensemble, that is, the maximum rate of activity for each year that is 
consistent with the event chronologies within uncertainties. We examined sensi- 
tivity both to the contributions of individual regions (issues of reliability have 
been raised with some events in the Gulf Coast” and southeastern US coast” 
records), and to the assumed threshold of the sites to overwash from varying 
strengths of hurricanes (Supplementary Information). 

Statistical model. We used a statistical model of tropical cyclone counts as 
conditioned on*'®: the MDR SST, the ENSO (measured by the boreal winter 
Nifio3 SST index), and the boreal winter NAO index. The tropical cyclone count 
series was first corrected for a modest estimated undercount’ before the mid- 
twentieth-century undercount, although similar results were obtained 
(Supplementary Information) using the largest*”? published estimates of under- 
count bias. The statistical model, which is trained on the modern historical 
record, has been shown in independent statistical validation experiments*’® to 
resolve roughly 50% of the interannual and longer-term variations in Atlantic 
tropical cyclone counts. The model, in this study, was driven by decadally 
smoothed proxy reconstructions of the three required climate indices to yield 
predictions of tropical cyclone activity over past centuries. The MDR SST and 
Nino3 reconstructions were derived from proxy-based surface temperature 
patterns spanning the past 1,500 years'’. Though an NAO reconstruction was 
available only for the past 500 years'*, the NAO influence was found to be very 
minor (Supplementary Information). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Formation of regional sediment series and composites. Regional landfalling 
hurricane chronologies were formed by combining sediment overwash records 
that fall within the same distinct five regions back to aD 500. The regions and 
associated records are: 

(1) New England (two records): Mattapoisett Marsh, Massachusetts, 250 Bc to 
present, latitude 41° 39’ 8'’ N, longitude 70° 47’ 13’’ W (ref. 15); and Succotash 
Marsh, Rhode Island, ap 1300 to present, 41° 22' 45'’ N, 71°31’ 17'' W (ref. 14) 
(2) the Mid-Atlantic (three records): Alder Island, New York, 238 Bc to present, 
40° 35'54''N, 73° 34’ 45'' W (ref. 8); Brigantine, New Jersey, AD 600 to present, 
39° 25' 14" N, 74° 21' 11’' W (ref. 13); and Whale Beach, New Jersey, AD 1300 to 
present, 39° 11’ N, 74° 40’ 17’' W (ref. 12) 

(3) the southeast US Atlantic Coast: Singleton Swash, South Carolina, AD 223 to 
present, 33° 46’ N; 78° 47’ W (ref. 11) 

(4) the Gulf Coast: Western Lake, Florida, 1726 Bc to present, 30° 19’ 38’' N; 
86° 08'55"" W (ref. 11) 

(5) the Caribbean: Laguna Playa Grande Lake, Vieques, Puerto Rico, 3461 Bc to 
present, 18° 5’ 31'’ N; 65° 31’ 3"’ W (refs 6, 9 and 10) 

As discussed in the main article, potential biases have been noted for some of the 
records used, including the Western Lake record’ and the Singleton Swash 
record”. For the Singleton Swash record, we made an explicit attempt to deal 
with certain known problems as discussed in the ‘Processing of individual sedi- 
ment records’ section below. 

Processing of individual sediment records. The three oyster-bed termination 
events (AD 223, AD 652, and AD 1283) from the Singleton Swash, South Carolina, 
record were not included in our analysis because of the uncertainty regarding the 
attribution for oyster reef terminations to hurricanes'!”*, as well as potentially 
significant and undefined age uncertainties related to reservoir effects for the 
dated oyster shell material. In the absence of such a correction, the chronologies 
of these events do not meet the age control required for our analyses. 

We adjusted the Vieques, Puerto Rico, record, on the basis of the likelihood of 
an artificial trend in sensitivity of hurricane landfall at the site owing to a sub- 
stantial increase in sedimentation rates since AD 1700 (refs 9, 10). Following ref. 9, 
we assumed that changes in sedimentation rates result in an undercounting of 
approximately 11% after 1700 aD compared to 32% before AD 1700, and so we 
down-weighted the post-ap 1700 Vieques record by a factor of 11%/32% (that is, 
approximately one-third). Alternatively, we performed an analysis in which the 
Vieques record was not used subsequent to AD1700 (see Supplementary 
Information). 

Formation of regional sediment composite series. In the process of forming 
regional composites, we attempted to eliminate redundant representation of 
unique events among multiple contributing records. Multiple events among 
contributing sites within a region that fell within the 1o age model uncertainties 
of each other were consolidated to represent a single assumed landfall event. This 
decision was motivated by the fact that known landfall events falling within the 
modern historical observational period are often recorded at more than one site 
within an identified region. The date and lo ranges for the consolidated events 
were defined as the average of the dates and ranges of the contributing events. 
When multiple events from one site fell within the age model uncertainties of an 
event from another site, the consolidation was done for the event for the first site 
that was closest in nominal age to that of the second site. 

Formation of a basin-wide sediment composite series. To estimate basin-wide 
hurricane activity from the available regional composites, we normalized each 
regional event composite by the number of events in that composite, and then 
weighted the normalized sequence of events by the estimated modern return 
period for that region (see ‘Estimation of return periods’ section below). This 
process ensures that each site contributes to the estimated basin average in 
proportion to the modern frequency of landfalling hurricanes for that region, 
simulating the process by which any underlying basin-wide activity is expressed 
in terms of regional landfall activity. The results of the analysis were not espe- 
cially sensitive, however, to whether or not the data were normalized, as shown in 
Supplementary Fig. 1. The basin-wide composite was considered as terminating 
in the year of the last recorded event. The latest such year in any of the chro- 
nologies was 1991. 

Monte Carlo ensembles. A nominal chronology of basin-wide landfalling 
hurricane occurrences is defined by the weighted composite as described in 
the ‘Formation of a basin-wide sediment composite series’ section above. 
However, this nominal chronology does not take into account age model un- 
certainties in the chronologies. To take into account the impact of age model 
uncertainties, we performed Monte Carlo experiments using ensembles of 2000 
realizations in which the individual events in the regional chronologies were 
randomly perturbed within their estimated +1o radiocarbon age model un- 
certainties. We then defined a probabilistic time series of basin-wide landfalling 


nature 


hurricane rates as the maximum values over this ensemble, that is, the time 
sequence of the maximum occurrence rates for each year that are consistent with 
the event chronologies and their uncertainties. 

Estimation of return periods. We estimated landfall return periods for weight- 
ing our sites on the basis of estimated return frequencies for storms striking 
within a 270-km radius of the site, obtained by the HURISK statistical modelling 
described in ref. 19. The radius of 270 km was chosen in order to have a large 
enough area for obtaining appropriate statistics using the HURISK model, yet 
small enough that the return periods reflect the relative activity at a site com- 
pared to the others within the composite. The radius of hurricane impact for 
each site is probably less than 270km, and therefore the return periods for 
overwash at each site would probably be longer than those predicted using this 
radius. However, these derived return periods are used only to obtain relative 
weights when assimilating the different records, with actual return frequencies 
determined by the reconstructions themselves. 

It is likely that the various sites differ in the category of storm they are sensitive 
to. For example, the Alder Island and Mattapoisett sites are probably recording 
storms of category 2 or greater*!’. Although there are no modern deposits at 
Western Lake, Florida, making it difficult to assess the exact sensitivity of the site, 
ref. 7 estimates a likely sensitivity to a storm of category 4 or greater. The 
remaining sites are probably sensitive to storms of category 3 or greater. 

The return periods for storms at each respective site based on the HURISK model 
given the above-assumed sensitivities are as follows (parentheses indicate the 5% 
and 95% uncertainties in years). (1) Mattapoisett, Massachusetts (=category 2): 
8.52 years (6.31, 13.09); (2) Alder Island, New York (=category 2): 10.15 years 
(7.35, 16.43); (3) Singleton Swash, South Carolina (=category 3): 7.84 years 
(5.90, 11.69); (4) Western Lake, Florida (=category 4): 45.66 years (27.69, 
129.98); (5) Vieques, Puerto Rico(=category 3): 5.37 years (4.22, 7.39). 

All distinct events were included in regional composites, so the chronologies 
of the regional composites are dominated by the individual sites with the largest 
number of contributing events and shortest return periods. Return periods for 
the regional composites were therefore defined by the most active site contri- 
buting to the composite. The mid-Atlantic composite was accordingly assigned 
the Alder Island return period, while the New England composite was assigned 
the Mattapoissett return period. 

For comparison, we also examined the case where the HURISK return periods 
used for weighting were instead assigned based on an assumption of uniform 
sensitivity to a major hurricane (that is, a storm of category 3 or greater) passing 
within 270 km of each site. (1) Mattapoisett, Massachusetts: 23.81 years (15.5, 
51.3); (2) Alder Island, New York: 25.7l years (16.25, 61.45); (3) Singleton 
Swash, South Carolina: 7.84 years (5.90, 11.69); (4) Western Lake, Florida: 
8.78 years (6.58, 13.16); (5) Vieques, Puerto Rico: 5.37 years (4.22, 7.39). 

Alternative results from those shown in the main article based on using these 
latter landfall return period estimates are provided in Supplementary 
Information. For completeness, we also considered the extreme (and rather 
implausible) case where all sites are assumed to have equal return periods 
(Supplementary Information). In all cases, the basic features of the basin-averaged 
record are preserved (for example, the elevated activity during the interval 
AD 900-1100), but the detailed evolution differs. 

Finally, an assessment was made of the robustness of the basin-wide composite 
with respect to the contributions of each of the five distinct regions using a 
traditional jackknife analysis wherein each of the five regions were eliminated 
one-by-one, and composites were performed using only the four remaining 
regions. The resulting five jackknife surrogates are shown in the 
Supplementary Information. The spread among the five jackknife surrogates 
defines the standard errors shown in Fig. 3 for the sediment composite record. 
Modern calibration and validation of statistical model. The statistical model 
was trained over the full (1870-2006) 137-year interval of overlap between the 
available instrumental climate state variables and historical tropical cyclone count 
record as in ref. 3, and the same split calibration/validation procedure, wherein 
the model was alternatively calibrated and validated over the half-intervals 1870- 
1938 and 1939-2006, was used. The same instrumental data products were used as 
in ref. 3, including blended HadCRU/ERSST/Kaplan*!*’ instrumental SST 
products for the Aug-Oct MDR SST and Dec-Feb Nifio indices, and the CRU 
(ref. 34) Dec—-Mar NAO series. We note that the Nifio3 index of ENSO (rather 
than the Nifio3.4 index favoured in ref. 3) was used, because a palaeoclimate 
reconstruction is available only for the Nifio3 index and not for the Nifio3.4 index. 
As noted in ref. 3, however, which Nijio index is used has very little influence on 
the resulting statistical model (statistical model resolved variance is 45%/41% for 
full calibration/validation, as compared with 50%/43% in ref. 3). For sake of 
comparison, the instrument record-based statistical model was extended back 
in time from 1870 to AD 1851 using the longer-term instrumental data provided 
by refs 17 and 18, as shown in Fig. 2. 
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As discussed in the main text, the historical tropical cyclone record was 
corrected for an estimated average undercount of 1.2 storms before aircraft 
reconnaissance began (pre-1944) as in ref. 3, but the conclusions of our study 
are insensitive to whether this estimate, or the more sizeable undercount bias 
argued by Landsea et al. (see refs 2, 29) is used (Supplementary Information). 
Additional tests performed elsewhere*’ have used an alternative series we term 
the ‘MDR residual’ (MDR SST minus the global tropical mean SST during Aug— 
Oct), which has been argued in certain studies**’’ to be a better predictor of 
tropical Atlantic tropical cyclone counts than the MDR SST itself. These tests 
reveal the MDR residual to be an inferior predictor to MDR SST across all 
reconstruction skill metrics; in particular, the statistical model cannot reproduce 
the positive trend of the past two decades when the MDR residual is used as a 
predictor instead of MDR SST. 

The statistical model was examined for adequacy with respect to regression 
assumptions (that is, that the assumption of Poisson-distributed regression 
residuals is met), based on ¢ and likelihood-ratio tests. The statistical model 
trained over the full 137-year period has a residual deviance of D = 107.98 with 
N= 133 degrees of freedom. A ¢ test indicates « = 0.945 (that is, a 95% chance 
that we would be incorrect in rejecting the null hypothesis of Poisson-distributed 
residuals). The null deviance (that is, the residual deviance for an assumed fixed- 
rate Poisson process) is D= 196.91 with N= 136 degrees of freedom. A like- 
lihood test based on the difference AD = 88.93 with N= 3 degrees of freedom 
indicates a statistical significance of P = 0.0 for the statistical model itself (that is, 
a 0% chance that we would be incorrect in rejecting the hypothesis that the model 
coefficients for the three predictors are all zero). 

Statistical prediction of tropical cyclone counts using proxy reconstructions. 
Here the model was applied to decadally resolved reconstructions of MDR SST 
and Nifio3 described by ref. 17 and the decadally smoothed winter NAO index of 
ref. 18. For the instrumental interval (1851 to present), standard errors due to 
uncertainties in the model coefficients were calculated from the residual decadal 
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variance diagnosed from the validation residuals (standard errors were averaged 
for the early and late intervals of the split calibration/validation procedure). For 
the pre-1851 statistical model estimates, which are driven by reconstructed 
climate indices, there is an additional component of uncertainty due to the 
uncertainties in the climate indices themselves. This contribution was estimated 
by Monte Carlo simulations in which the statistical model was driven with an 
ensemble of 2000 randomly perturbed versions of the statistical predictors con- 
sistent with their estimated uncertainties!’, and an additional random term due 
to the uncertainties in the model coefficients. 

Finally, to determine the separate roles of the individual predictors, we 
performed statistical model runs in which each of the predictors (NAO, MDR 
SST and Nifio3) was kept constant at its modern climatological mean value, while 
the other two predictors were allowed to vary (Supplementary Information). 
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Glacial effects limiting mountain height 


D. L. Egholm’, S. B. Nielsen’, V. K. Pedersen’ & J.-E. Lesemann’ 


The height of mountain ranges reflects the balance between 
tectonic rock uplift, crustal strength and surface denudation. 
Tectonic deformation and surface denudation are interdependent, 
however, and feedback mechanisms—in particular, the potential 
link to climate—are subjects of intense debate’. Spatial variations 
in fluvial denudation rate caused by precipitation gradients are 
known to provide first-order controls on mountain range width, 
crustal deformation rates and rock uplift**. Moreover, limits to 
crustal strength’ are thought to constrain the maximum elevation 
of large continental plateaus, such as those in Tibet and the central 
Andes. There are indications that the general height of mountain 
ranges is also directly influenced by the extent of glaciation 
through an efficient denudation mechanism known as the glacial 
buzzsaw*”. Here we use a global analysis of topography and show 
that variations in maximum mountain height correlate closely 
with climate-controlled gradients in snowline altitude for many 
high mountain ranges across orogenic ages and tectonic styles. 
With the aid of a numerical model, we further demonstrate how 
a combination of erosional destruction of topography above the 
snowline by glacier-sliding and commensurate isostatic landscape 
uplift caused by erosional unloading can explain observations of 
maximum mountain height by driving elevations towards an alti- 
tude window just below the snowline. The model thereby self- 
consistently produces the hypsometric signature of the glacial 
buzzsaw, and suggests that differences in the height of mountain 
ranges mainly reflect variations in local climate rather than 
tectonic forces. 

Distinctive alpine landforms, such as broad ‘U’-shaped, flat-floored 
valleys, hanging valleys, cirques, horns and knife-edged ridges (arétes), 
are considered the fingerprints of glacial erosion. These glacial land- 
forms exist in most of Earth’s mountain chains, produced by present 
and past glaciers. A majority of these landforms are associated with 
pronounced topographic relief consisting of over-steepened valley 
sides, headwalls and near-orthogonal tributary junctions often 
occupied by spectacular waterfalls. Perhaps not surprisingly, glacia- 
tions have accordingly been assumed to increase average relief mainly 
by incising valley systems, leaving high elevation peaks and hillslopes 
almost unaffected, and producing significant isostatically driven peak 
uplift’®. 

However, it has recently been discovered that glaciated orogens in 
the Himalayas®, the Andes’, the Sierra Nevada (USA)' and the 
Cascade Range® hold a striking coincidence of snowline altitudes, 
glacier equilibrium line altitudes (ELA) and elevations with a high 
proportion of surface area, suggesting that operation of a glacial 
buzzsaw denudation mechanism may be effective in reducing surface 
topography above the snowline and concentrating it at the snowline. 

Apatite “He/*He ratio and (U-Th)/He thermochronometry studies 
from British Columbia’ and the St Elias orogen in Alaska’’ support 
the notion of rapid glacial erosion at altitudes near and above the 
snowline, and indicate that climatically controlled snowline lowering 
dramatically increased average erosion rates during the late Cenozoic. 


The geomorphic signature of the glacial buzzsaw is a concentration 
of surface area at elevations corresponding to the glacial ELA or the 
snowline®*"', which roughly coincide for temperate glaciers". In hyp- 
sometric distributions (Fig. 1a), this reveals itself as a local maximum 
(representing a high proportion of surface area) at an altitude corres- 
ponding to the snowline®'’. To explore the prevalence of the glacial 
buzzsaw, we thus analysed the global distribution of hypsometric 
maxima (Fig. 1b) and surface area (Fig. 1c) using the Shuttle Radar 
Topography Mission’® (SRTM) digital elevation models (DEMs) and 
compared this with observations of modern and Last Glacial 
Maximum (LGM) snowline altitudes'”!*. We focus on the latitude 
dependence because the variation in snowline altitude is larger 
with latitude than with longitude (or with time); see Supplementary 
Figs 1-4 for more details of the analysis. 

The analysis first reveals how only little surface area and practically 
no hypsometric maxima occur at elevations above the local modern 
snowline altitude (Fig. 1b and c). Second, below the snowline, surface 
area is concentrated in readily recognizable tectonically uplifted pla- 
teaus and near sea level, where alluvial plains increasingly dominate 
as they grade to sea level (Fig. 1c). The modern snowline seems to 
closely follow the 107° contour of normalized surface area (the blue 
end of the colour scale in Fig. 1c), emphasizing its influence on high- 
altitude hypsometry. 

When a DEM tile has topography above the snowline, its highest 
hypsometric maximum generally exists between the modern and LGM 
snowlines (Fig. 2). This pattern is recognized globally in every 
mountain range with sufficient height to intersect the LGM snowline 
(Supplementary Fig. 4). Hence, the abundance of hypsometric maxima 
just below the modern snowline and the absence of hypsometric 
maxima above the snowline are largely independent of tectonic uplift 
rate, lithology and general tectonic setting. For example, the Himalayas 
are in an intra-continental setting caused by continent—continent 
collision between India and Asia, whereas the high topography in 
western North and South America and in New Zealand is adjacent 
to oceans and caused by subduction and related volcanism. Yet, all 
show similar correlation between the distribution of hypsometric 
maxima and the local snowline altitude. 

It further appears that glacial erosion controls the maximum 
height of mountains, as most summit elevations are confined to 
altitudes <1,500 m above the local snowline (Fig. 2). Although some 
of the highest peaks are evidently not included in the analysis owing 
to data gaps, this general trend suggests that glacial erosion restricts 
the height of mountains by limiting the relief that can be maintained 
between mountain peaks and the snowline’. The recorded outliers 
(Fig. 2) represent solitary peaks of primarily volcanic origin and most 
exist in the Andes, implying that formation and ongoing build-up of 
high volcanoes can outpace glacial buzzsaw denudation. Notable 
exceptions to the trend also exist in the Transantarctic Mountains 
outside the data set where a reduction or even non-existence of the 
glacial buzzsaw mechanism is likely, as non-eroding polar glaciers 
(frozen at the bed) dominate where the ice cover is relatively thin’. 
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Figure 1| Global prevalence of the glacial buzzsaw. a, Hypsometric 
distributions from a fluvial (northern Andes) and a glacial (Cascade Range, 
USA) landscape. Both hypsometries contain local maxima, but at different 
elevations. The hypsometric distributions each represent a 1° X 1° DEM 
with coordinates (lower left corner) (09° N, 71° W) for the fluvial landscape 
and (48° N, 121° W) for the glacial. b, Global distribution of hypsometric 
maxima as function of latitude and elevation. Each hypsometric maximum 
found in the data set is represented by a circle coloured by its DEM database 
entry (upper left inset). Hypsometric maxima derived from DEMs with more 
than 5% data gaps are white. The grey shaded area represents the total data 
coverage set by the maximum elevation at each latitude. Also shown are the 


Generally, the presence of a hypsometric maximum indicates that 
dynamic processes (tectonic, erosional or depositional) concentrate 
surface elevation within a narrow altitude interval. Thus, the high 
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Figure 2 | Maximum elevations and hypsometric maxima elevations 
correlate with local snowline altitudes. Shown are all maximum DEM 
elevations (peaks) above the modern snowline and their associated highest 
hypsometric maxima. Most peaks are less than 1,500 m above the snowline, 
and apart from few exceptions all hypsometric maxima are found below the 
modern snowline and less consistently above the LGM snowline (the two 
snowlines largely demarcate the amplitude of the late Cenozoic climate 
variations). At latitudes >27° N, local snowline observations'’® (latitude- and 
longitude-dependent) are used, whereas the east Pacific snowline transects'” 
are used independent of longitude for DEMs with latitude <27°N. 
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modern (black line) and LGM (dashed line) snowline transects along the east 
Pacific (EP) coasts of North and South America” and more than 13,000 
modern snowline observations from the World Glacier Inventory"* averaged 
in bins of 1° latitude where available (red line). ¢, Distribution of surface area 
(A) based on latitude (L) and elevation (E) within the data set. Colours show 
area at a given latitude and elevation normalized by the total area (Ajota)) of 
the data set. Note the logarithmic colour scale. The grey zone has very little 
area and falls outside the colour scale, but serves to outline the total data 
range. Ellipses mark tectonically uplifted plateaus: 1, South Africa; 2, the 
Altiplano (Andes), 3, the Tibetan plateau; and 4, the Tarim basin. (See also 
Supplementary Figs 1-4.) 


density of hypsometric maxima at low elevation (Fig. 1b and 
Supplementary Fig. 2) reflects, among other processes, fluvial erosion 
and deposition controlled by a base level ultimately set by the sea. 
Following this reasoning, and the findings of others®*'''*, the 
abundance of hypsometric maxima associated with snowline alti- 
tudes and the general absence of hypsometric maxima above the 
snowline (Fig. 1) likewise suggest that glaciers concentrate surface 
area just below the snowline, which thereby acts like a local base level 
for glacial erosion, leaving only a limited amount of topography 
(horns and arétes) rising above this level® (Fig. 1c). In large trunk 
valleys and fjords, glacial erosion is, however, known to reach far 
below the snowline”’. 

This erosional pattern is confirmed by computational model 
experiments coupling the flow equations of ice with glacial erosion 
driven by sliding of warm-based glaciers. As an example illustrating 
the imprint of glacial erosion, we simulate the evolution of an initially 
fluvial landscape during a dramatic lowering of the snowline altitude. 
The Sierra Nevada mountain range in southern Spain (37° N, 3° W) 
is used as the initial landscape as it was little affected by Quaternary 
glaciations, and only few cirque valleys near the highest summit areas 
above 2,500m bear testimony to the past appearance of alpine 
glaciers. The landscape is thus largely fluvial (Fig. 3a), with relatively 
narrow river valleys emanating from the 60-km-long central ridge 
running broadly east-west. Accordingly, the hypsometric distri- 
bution derived from SRTM data’® shows gradual decrease of surface 
area with increasing height above the 1,300 m contour line and with 
no indication of a local maximum (Fig. 3b). 

To fully isolate the effects of glacial erosion and isostasy, the model 
does not include tectonic uplift, fluvial erosion or periglacial pro- 
cesses. In order to prevent slopes from steepening unrealistically, the 
model does, however, include nonlinear hillslope diffusion simulat- 
ing mass wasting on the steepest slopes”’. In the model, the snowline 
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Figure 3 | Numerical model of glacial erosion in the Sierra Nevada (Spain). 
a, Initial fluvial topography derived from 90 m SRTM data’’. The thick black 
contour line represents the model snowline altitude after 100 kyr, while the 
white contour line shows the extent of glaciers at time 100 kyr. 

b, Hypsometry before and after glacial erosion. A hypsometric maximum 
develops owing to denudation of hillslopes and small valleys above the 
snowline combined with isostatic uplift of unglaciated surfaces below the 
snowline. To some degree, the deep erosion of trunk valleys opposes the 
development of a hypsometric maximum by lowering their surface area to 


altitude is reduced from 3,000 m to 2,000 m (comparable to a 20° 
northward shift and conditions as found at present in, for example, 
British Columbia, Canada) over a period of 100 kyr, and in this stage 
the total ice volume grows until a quasi-steady state balance between 
total glacier accumulation and ablation is established. This leads into 
a longer phase (400 kyr) of glacial erosion and isostatic compensa- 
tion. Although isostasy causes peak uplift’, the ice volume decreases 
owing to overall erosional destruction of catchment area above the 
snowline. Hence, after 500 kyr the total ice volume is reduced to 20% 
of its maximum value (Supplementary Fig. 5). 

In agreement with the glacial buzzsaw hypothesis, this model 
experiment reveals (Fig. 3b) how snowline lowering self-consistently 
leads to emergence of a hypsometric maximum just below the snow- 
line owing to erosion of surface topography at, and above, the snow- 
line and compensatory isostatic uplift affecting the entire landscape, 
including unglaciated hillslopes at lower altitudes. 

The model results also highlight the variable influence of topo- 
graphy on ice flow. Trunk valley glaciers initially reach far below the 
snowline and cause deep valley incision. Their large catchments allow 
for sufficient ice flux below the snowline and erosion along most 
portions of the glacier in this region. The largest tributary catchments 
undergo more limited erosion than trunk valleys and develop hang- 
ing valleys, which often exhibit cirques near their headwalls (Fig. 3c 
and d). In smaller tributaries and on hillslopes, the key control on 
erosion appears to be the limited catchment area, which does not 
allow for ice influx to exceed ablation below the snowline and, in 
effect, the snowline acts like a climatic base level limiting the down- 
valley extent of glacial erosion. Glaciers with small catchment areas 
are thus fundamentally restricted by the snowline and unable to 
incise the landscape much below this level. Instead, they slowly erode 
headward into the landscape, leaving behind low-relief surfaces 
(Fig. 3d) at an altitude just below the snowline: the signature of the 
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elevations far below the snowline. c, Model topography after glacial erosion 
to a stage where only narrow and steep ridges (arétes) remain above the 
snowline. Valleys are widened and deepened below the snowline. The 
amplitudes of isostatic rock uplift are shown by dashed contour lines. 

d, Enlarged view of a catchment exhibiting classical glacial erosion features 
including arétes (A), cirques (C), hanging valleys (H), and low relief areas (L) 
produced by the glacial buzzsaw. For more information on model results, see 
Supplementary Figs 5-10. 


glacial buzzsaw. These low-relief surfaces are, however, separated by 
pronounced overdeepening in trunk valleys. 

Isostatic uplift varies from ~200 m in the centre of the mountain 
range to 20 m in the upper right corner of the model (Fig. 3c). Thus, 
surface elevation increases in areas of the model where erosion rates 
are low. This applies to the highest arétes and peaks as well as the 
lower parts of the hillslopes, which before glacial erosion were at 
altitudes below the snowline and therefore unglaciated. When the 
latter undergo isostatic uplift, some eventually pass the snowline and 
experience modification by glacial erosion as described above and 
further add to the hypsometric maxima. 

In addition to classical features of alpine glaciation (“U-shaped 
valleys, hanging valleys, overdeepenings, ridges and cirques; Fig. 3d), 
our model thus predicts that alpine glacial landscapes will naturally 
undergo strong erosion at and above the snowline. This erosion and 
its coupling with flexural isostasy produce the high proportion of 
surface area below the snowline, in agreement with global hypsomet- 
ric observations. The concentration of surface area occurs in the 
temperature window (~0°C) where periglacial processes (not 
included in the present model) are most effective. Thus, low-relief 
landscapes in the vicinity of the snowline will probably be further 
developed by frost cracking and frost creep”. This will effectively 
plane off and further flatten remnant short-wavelength topography 
into smooth low-curvature surfaces similar to those documented in 
the Laramide ranges of the western United States**”>. 

Active tectonic uplift counteracts the erosional destruction of 
topography above the snowline or balances it’*, but the evidence 
presented here and the results of other studies®*''’ indicate that 
glacial buzzsaw erosion is capable of keeping pace with any modern 
tectonic uplift rate. Therefore, the relationship between tectonism 
and climate is such that climate and latitude ultimately control the 
height to which tectonic processes can drive topography®’. 
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METHODS SUMMARY 


The SRTM data set covers all land area within 120°of latitude (60° S to 60° N). 
Each of the 14,546 DEMs samples an area of 1° X 1° with 3-arcsec resolution. For 
every DEM, we calculated the hypsometric distribution using 40 bins uniformly 
spaced between the DEM’s minimum and maximum values, and systematically 
sampled the elevation(s) to any existing local maximum (see Supplementary 
Figs 1-4 for more details on the topographic analysis). 

In the numerical model, the general treatment of ice motion and glacial 
erosion closely resembles other two- and three-dimensional models”” of glacial 
landscape evolution. The numerical model is however based on the second-order 
shallow ice approximation” (SOSIA), which allows for simulating long-term 
glacier evolution at a higher resolution by accounting for important higher- 
order effects related to a rugged basal topography, rapid ice flow accelerations, 
and large gradients in longitudinal stress and ice thickness. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Overview. The present numerical landscape evolution model serves to demon- 
strate how the glacial buzzsaw denudation mechanism inherently follows from 
already established assumptions on glacier dynamics and glacial erosion. 
Compared to existing modelling studies*'*’, there are no new assumptions. 
The governing equations are, however, solved using a novel approach 
(Supplementary Information). This allows prediction of three-dimensional 
patterns of glacial erosion on a much finer scale than previously because it con- 
siders the effects of steep bed topography and gradients in horizontal stress within 
the ice. Importantly, this permits modelled erosion patterns to be analysed at the 
scale of the well-known glacial landforms (hanging valleys, cirque valleys, and so 
on). 

The mechanism of glacial buzzsaw erosion is very robust when varying model 

parameters controlling, for example, ice flow velocities, erosion rates and sub- 
glacial water pressure. This is because the buzzsaw signal arises primarily because 
of ablation patterns caused by the atmospheric temperature gradient. In other 
words, the downward restriction of small glaciers does not depend on the exact 
nature of the ice flow, but on the fact that small glaciers quickly melt when 
passing the snowline altitude. 
Modelling ice flow and glacial erosion. Ice motion is the sum of internal 
deformation and basal sliding. The internal deformation is governed by Glen’s 
flow law, ¢ = Ate Tip relating components of the strain rate tensor, é;, to com- 
ponents of the deviatoric stress tensor, tj. Te is the effective stress** and 
A=2.25X 10 '’Pa *yr~! is the Arrhenius constant for ice at temperatures 
around —10 °C (ref. 37). 

The sliding velocity is assumed aligned with the basal shear stress, t,, and of 
magnitude given by the generalized Weertman law™: u, = C,|t5|""/(pi — Pw)”. 
Here C,=2X10 °mPa “yr ' isa sliding coefficient, p; is the ice overburden 
pressure normal to the bed and p, is the meltwater pressure at the bed. We follow 
others*’*° and assume a constant ratio between melt water and overburden 
pressures, py = 0.2p; and that m= 3 and n= 1. Although there is some experi- 
mental support*”“° for the latter choice, m and n are poorly constrained para- 
meters. We have, therefore, explored the sensitivity of the modelling results to 
variations in m and n, and in agreement with ref. 41 we find that changing the 
parameters mainly affects the shape of valleys and not our conclusions regarding 
erosional patterns of the buzzsaw denudation mechanism. 

Mass balance is calculated as the sum of accumulation, M,, surface ablation, M,, 
and basal melting, M,, all in units of myr | ; M,=-—0.1min(0,T,), 
M, = —0.15max(0,T,) and M, = —0.05max(T,,,T). Here T; (in units of °C), 
given by Tp — 5h, is the ice surface temperature at altitude h (in units of km); 
Tp is basal temperature as predicted from one-dimensional temperature profiles 
including conduction and vertical advection of heat*’; T,, = —0.00087H is the 
melting point of ice (His ice thickness in units of m). To is reduced from 15 °C to 
10°C during the initial 100 kyr. The downward migration of the snowline allows 
wet-based glacial erosion to affect, at varying degrees and different times, all 
altitudes in the landscape above the final snowline altitude. This ensures that 
the highest peaks and arétes (which after 100 kyr are in a temperature window 
with non-eroding cold-based ice) have experienced glacial erosion and gives them 
a realistic glacial appearance. The slow downwards migration of the snowline can 
be understood as (1) aslowand long-term average temperature decline (averaging 
shorter term second order variations) or (2) tectonic uplift that transports topo- 
graphy slowly upwards through the snowline altitude. 

We assume that the local glacial erosion rate normal to the bed is primarily 
governed by abrasion and quarrying, and hence**’, proportional to local sliding 
rates: e = k.|us|. ke is calibrated to 1 X 10° * by requiring average erosion rates to 
be less than 1 mmyr ', which is a conservative estimate’’. Depending on the 
altitude and the thickness of the ice, the model glaciers above the snowline can be 


nature 


frozen to the bed. The basal sliding rate is then zero and ice transport is solely due 
to internal deformation. 

The Stokes mechanical equilibrium equations are solved using the second- 
order shallow ice approximation*® (SOSIA) and a finite volume formulation (see 
Supplementary Information for details) with an irregular grid and explicit time 
integration. The higher-order method allows for solving the ice flow equations 
accurately, also when basal topography is steep and when significant gradients in 
longitudinal and transverse stress are present. The irregular grid consists of 
31,480 Voronoi cells with average spacing of 400 m. 

Modelling hillslope processes. In order to prevent surface slopes from increas- 
ing unrealistically, we include mass wasting on steep slopes using a nonlinear 
diffusion model***’. The model expresses the flux of material, qg = —Ks/[1 — 
(s/s.)*], as a function of surface slope, s. k = 1m/’yr™! is the diffusivity of low 
gradient hillslopes, while s, = 1.0 defines a critical slope near which the effective 
diffusivity increases towards infinity. 

Modelling flexural isostasy. Flexural isostasy is incorporated by coupling the 
erosion model to a two-dimensional uniform thin elastic plate** with an elastic 
thickness of 10 km. The densities of the eroding crust and the asthenosphere are 
2,800kgm~* and 3,300kgm °, respectively. Erosional unloading and the 
weight of the ice cause deflections of the plate, which are fed back into the model 
as vertical surface movements. The relatively small size of the Sierra Nevada 
mountain range combined with the elastic thickness of 10 km effectively limits 
the amplitudes of isostatic uplift following erosion. The model presented is thus 
conservative with respect to how isostasy contributes to the mechanism of the 
glacial buzzsaw. 
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Pelvic claspers confirm chondrichthyan-like internal 


fertilization in arthrodires 


Per Ahlberg’, Kate Trinajstic?**, Zerina Johanson® & John Long®”® 


Recent finds'” demonstrate that internal fertilization and viviparity 
(live birth) were more widespread in the Placodermi, an extinct 
group of armoured fishes, than was previously realized. 
Placoderms represent the sister group of the crown group jawed 
vertebrates (Gnathostomata)**, making their mode(s) of reproduc- 
tion potentially informative about primitive gnathostome condi- 
tions. An ossified pelvic fin basipterygium discovered in the 
arthrodire Incisoscutum ritchiei was hypothesized to be identical 
in males and females, with males presumed to have an additional 
cartilaginous element or series forming a clasper. Here we report the 
discovery of a completely ossified pelvic clasper in Incisoscutum 
ritchiei (WAM 03.3.28) which shows that this interpretation was 
incorrect: the basipterygium described previously’ is in fact unique 
to females. The male clasper is a slender rod attached to a square 
basal plate that articulates directly with the pelvis. It carries a small 
cap of dermal bone covered in denticles and small hooks that may be 
homologous with the much larger dermal component of the 
ptyctodont clasper. 

The presence of embryos within an adult female of the placoderm 
Incisoscutum ritchie, a member of the Arthrodira, was recently 
described'. This demonstrated that live birth and internal fertilization 
among placoderms are not restricted to the Ptyctodontida, where 
embryos have been described in Materpiscis attenboroughi and 
Austroptyctodus gardiner?. Ptyctodont males possess modified pelvic 
elements, or claspers”®, used in living male chondrichthyans for 
internal fertilization. The existence of a pelvic clasper in Incisoscutum 
ritchici, hypothesized previously’, can now be confirmed in WAM 
03.3.28 and demonstrates sexual dimorphism in an arthrodire for 
the first time (Fig. la, b). The clasper, previously misidentified as the 
left pelvis of this individual, is composed of perichondral bone and 
articulates directly with the pelvic girdle (Fig. 1b), unlike sharks where 
the clasper articulates with a more proximal basipterygial element. The 
distal end of the clasper is thin and rod-like. Its tip is covered by a small, 
distinct cap of dermal bone, ornamented with small pits and denticles. 
The proximal part of the clasper is expanded into a plate pierced by 
four foramina, two larger above two smaller (Fig. 1b). The anatomy of 
the pelvic clasper most probably corresponds to the core of an erectile 
element, as in extant sharks. The basal element previously described’ 
in Incisoscutum ritchiei (WAM 04.10.02) is reinterpreted as the 
female basipterygium, rather than a proximal element common to 
both sexes. This female element is also seen in BMNH P50934, one 
of the specimens containing embryos. In both ptyctodonts and 
Incisoscutum the distal ends of the claspers are capped by a covering 
of dermal bone. However, in ptyctodonts the dermal bone covering 
is extensive and so far no internal element has been discovered’, 


suggesting that the endoskeleton was unossified, whereas in 
Incisoscutum the internal skeleton is perichondrally ossified and only 
the distal tip is covered by dermal bone. Notwithstanding these differ- 
ences of emphasis between the dermal and endoskeletal components, 
the claspers appear to share a common architecture that suggests 
homology between the two groups. We tentatively infer that claspers 
characterize the whole of the Ptyctodontida and Arthrodira; further 


Pelvic girdle 


a 


Clasping region 


Figure 1| The male pelvic girdle of Incisoscutum ritchiei WAM 03.3.28. 
a, Pelvic girdle in dorsal view; b, pelvic girdle restored. 
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study of well-preserved arthrodires should be undertaken to test this 
hypothesis. 
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Homothallic and heterothallic mating in the 
opportunistic pathogen Candida albicans 


Kevin Alby’, Dana Schaefer’ & Richard J. Bennett! 


Candida albicans is the most common fungal pathogen in humans, 
causing both debilitating mucosal infections and potentially life- 
threatening systemic infections”. Until recently, C. albicans was 
thought to be strictly asexual, existing only as an obligate diploid. 
A cryptic mating cycle has since been uncovered in which diploid 
a and a cells undergo efficient cell and nuclear fusion, resulting in 
tetraploid a/a mating products**. Whereas mating between a and 
a cells has been established (heterothallism), we report here two 
pathways for same-sex mating (homothallism) in C. albicans. 
First, unisexual populations of a cells were found to undergo 
autocrine pheromone signalling and same-sex mating in the absence 
of the Barl protease. In both C. albicans and Saccharomyces cerevi- 
siae, Bar] is produced by a cells and inactivates mating pheromone a, 
typically secreted by a cells’”°. C. albicans Abar1 a cells were shown 
to secrete both a and a mating pheromones; o-pheromone activated 
self-mating in these cells in a process dependent on Ste2, the receptor 
for a-pheromone. In addition, pheromone production by a cells was 
found to promote same-sex mating between wild-type a cells. These 
results establish that homothallic mating can occur in C. albicans, 
revealing the potential for genetic exchange even within unisexual 
populations of the organism. Furthermore, Barl protease has an 
unexpected but pivotal role in determining whether sexual re- 
production can potentially be homothallic or is exclusively hetero- 
thallic. These findings also have implications for the mode of sexual 
reproduction in related species that propagate unisexually, and indi- 
cate a role for specialized sexual cycles in the survival and adaptation 
of pathogenic fungi. 

The hemiascomycetous fungi are a diverse range of species that 
include the prototypical model yeast, Saccharomyces cerevisiae, as 
well as the prevalent human pathogen, Candida albicans. Sexual 
reproduction in both fungi is driven by the secretion of sex-specific 
pheromones that promote mating between partners of the opposite 
sex''?, Pheromone gradients are sensed by G-protein-coupled cell- 
surface receptors that initiate the mating response’*’. C. albicans 
mating is thought to be exclusively heterothallic, whereas isolates 
of S. cerevisiae that express HO endonuclease are homothallic 
because of mating-type switching’®. The discovery that unisexual 
populations of C. albicans can express both a and « mating phero- 
mones” led us to test whether autocrine pheromone signalling and 
homothallism can occur in this species. 

C. albicans a cells derived from the laboratory strain SC5314 were 
treated with synthetic x-pheromone and the quantitative polymerase 
chain reaction used to compare the induction of both a- and &-phero- 
mone genes (MFa and MFu, respectively). Opaque-phase cells were 
used for these experiments, as these are the mating-competent form of 
C. albicans®. Both MFa (300-fold) and MF (373-fold) were highly 
induced in response to o-pheromone, in contrast to S. cerevisiae 
where only a-pheromone is upregulated'*. In both C. albicans and 
S. cerevisiae, an extracellular barrier activity acts to antagonize the 


function of o-pheromone. The activity, encoded by BARI, is an aspar- 
tyl protease that cleaves «-pheromone, inactivating it’"'®. We there- 
fore analysed a C.albicans Abar1 mutant to determine whether 
a-pheromone production has a functional role in these cells. 
C. albicans Abar1 cells exhibited distinctive colony morphologies 
when cultured on solid media. Mutant colonies appeared increasingly 
wrinkled during growth on yeast peptone dextrose (YPD) medium 
(Fig. la). Cells retrieved from these colonies were highly elongated and 
showed similarities to cells that had undergone extended exposure to 
pheromone’. To confirm that elongated cells were undergoing a 
mating-type response and were not hyphal (filamentous) cells, green 
fluorescent protein (GFP) reporters were introduced under the 
control of pheromone-specific promoters. Both FIG] and FUS1 genes 
are highly expressed in response to pheromone”, and pFIG1-GFP and 
pFUS1-GFP constructs were therefore used to monitor activation of 
the mating program. Significantly, both FIGI and FUS1 expression 
was induced, confirming that cells were undergoing a mating-like 
response, even in the absence of exogenous pheromone or a mating 
partner of the opposite sex (Fig. 1b). To confirm that this phenom- 
enon was not limited to strains derived from $C5314, Abar1 mutants 
were also generated in P37005, a natural a/a isolate of C. albicans, and 
found to exhibit similar cell and colony phenotypes (Supplementary 
Fig. 1). 

Two models were considered for how autocrine pheromone 
signalling could lead to activation of a mating response. First, 
a-pheromone secretion could signal to Ste3, the receptor for a- 
pheromone (Fig. 2a, model I). Alternatively, «-pheromone could 
induce mating via binding to Ste2, the receptor for a-pheromone 
(model II). Although not mutually exclusive, these two models were 
tested by construction of strains lacking different combinations of 
pheromone or pheromone receptor genes. Significantly, loss of either 
o-pheromone (MF«) or its receptor (Ste2) completely abolished the 
wrinkled colony appearance of Abar1 a strains (Fig. 2b). In contrast, 
loss of STE3 (encoding the receptor for a pheromone) had no effect 
on colony phenotype (Fig. 2b). Thus, autocrine signalling in a cells is 
driven by «-pheromone production and requires the presence of 
Ste2, the cognate receptor for a-pheromone (model II). 

Autocrine pheromone signalling was also observed between 
neighbouring colonies of cells, as Abar1 a cells induced a response 
in adjacent colonies of a cells. Cells from wrinkled colonies again 
exhibited polarized growth indicative of a mating response (data not 
shown). Neighbouring cells failed to respond in the absence of STE2, 
indicating that signalling required the receptor for «-pheromone 
(Fig. 2c, row 2). Furthermore, Abar1 cells were unable to induce a 
response in adjacent cells if either MFx or STE2 was also deleted 
(Fig. 2c, rows 3 and 4). The dependence of signalling on STE2 is 
demonstrative of a positive feedback loop in these cells; «-pheromone 
secreted by Abar] cells binds to Ste2 receptors on these cells and results 
in further augmentation of pheromone production. Only when this 
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a Wild type 


Wild type 
FIG1-—GFP 
(-) a-pheromone 


Wild type 
FIG1-GFP 
(+) o-pheromone 
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FIG1-—GFP 
(-) «-pheromone 


Wild type 
FUS1-GFP 
(-) a-pheromone 
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FUS1-GFP 
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Figure 1| Deletion of BART results in an autocrine mating response. 

a, Patches of wild-type and Abar]1 a cells after 4 days growth on YPD 
medium. b, Mating response of wild-type and Abar] a cells with and without 
a-pheromone in Spider medium for 36 h. Differential interference contrast 
(DIC, left) and GFP fluorescence (GFP, right) images are shown. Induction 
of the mating response was monitored using GFP reporters under the 
control of the mating-response genes FIG1 or FUS1. All C. albicans cells used 
in this study were in the opaque phase as this is the mating-competent form 
of the organism®*”. Scale bars, 16 jtm. 


feedback loop is in effect does pheromone secretion reach the levels 
necessary for autocrine signalling. STE3, encoding the receptor for a- 
pheromone, was not involved in this signalling (Fig. 2c, column 4 and 
row 5). Production of «-pheromone by Abar! a cells was also evident 
in halo assays and was shown to increase biofilm formation by these 
cell types (see Supplementary Information). 

The existence of a robust autocrine mating pathway led us to test 
whether C. albicans is capable of homothallic mating. Two different 
auxotrophic Abar1 strains were mixed and co-incubated on Spider 
medium for 3 days. Cells were recovered from the mating mixture 
and plated onto selective medium to detect potential a—a mating 
products. A number of prototrophic mating products were obtained 
and analysis by PCR and flow cytometry revealed that these were 
stable tetraploid cells of mating type a (Fig. 3a, b). To compare the 
frequency of homothallic and heterothallic mating in C. albicans, 
several quantitative mating crosses were performed (Fig. 3c). 
Mating between wild-type a and « cells occurred efficiently (94%), 
whereas no mating was observed between two wild-type a strains 
(<0.00002%). Significant levels of unisexual mating were obtained, 
however, using two Abar1 a strains (0.037%), an increase of more 
than 1,850-fold over wild-type a—a mating. The efficiency of uni- 
sexual mating was further increased in crosses between Abar! and 
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Figure 2 | Autocrine signalling in C. albicans a cells. a, Model of 
conventional paracrine signalling between a and « cells and two models for 
autocrine signalling. Note that ‘autocrine’ signalling is used here to refer to 
signalling within populations of a cells. b, C. albicans a cells after 4 days 
growth on YPD medium. Whereas Abar! strains showed a highly wrinkled 
morphology, reconstituted strains in which the BARI gene had been 
reintegrated did not show wrinkling (not shown). c, Demonstration of 
autocrine signalling among different populations of a cells after 4 days 
growth on YPD medium. Wrinkling at the edge of a test colony is indicative 
of autocrine signalling. 


wild-type a strains, with 0.88% of cells undergoing mating, an 
increase of more than 40,000-fold over that between wild-type a 
strains (Fig. 3c). In fact, Abar] a cells mated as efficiently with other 
a cells as with o cells (0.80%), indicating a loss in sexual preference for 
one mating-type partner over the other. These experiments 


891 


©2009 Macmillan Publishers Limited. All rights reserved 


LETTERS 


a Diploid Tetraploid 


Cell 
counts 
Cell 
counts 


°o 
(=) 


0 Sytox 1,000 ) Sytox 1,900 
fluorescence fluorescence 
DSY739 DSY748 DSY753 
200 200 00 
-2# a2 2 
O38 O85 Os 
13) oO oO 
0 0 ty) 
) Sytox 1,000 0 Sytox 1,000 0 Sytox — 1,000 
b fluorescence fluorescence fluorescence 


123 45 6 7 8 9 1011 1213 14 15 16 17 18 19 20 21 22 23 24 25 


c 
Mating Genotypes crossed Mating frequency (%) 
1 Wild-type a x Wild-type a 94+15 
2 Wild-type a x Wild-type a <0.00002 
3 Abar1 a x Wild-type a 0.88 + 0.66 
4 Abar1 a x Wild-type a 0.80 + 0.29 
5 Abar1 ax Abari a 0.037 + 0.035 
6 Wild-type a x Wild-type a x (Wild-type a) 0.22 + 0.043 
7 Amfa ax Amfa a x (Wild-type a) 0.12 + 0.064 
8 Abar1/Aste2 a x Wild-type a <0.0002 
9 Abar1/Aste3 a x Wild-type a 0.2 + 0.69 
10 Abar1/Amfa a x Wild-type a <0.00002 
11 Wild-type o x Wild-type a x (Wild-type a) 0.043 + 0.049 


Figure 3 | Homothallic mating in C. albicans. a, Flow cytometric analysis 
indicating the tetraploid content of progeny isolated from a—a matings. The 
x-axis of each graph (Sytox) represents a linear scale of fluorescence and the 
y-axis (Cell counts) a linear scale of cell number. For analysis of additional 
same-sex mating products see Supplementary Fig. 6. b, PCR confirming the 
mating type of a~a mating products using primers directed at OBPa and 
OBPza. Odd-numbered lanes are OBPa reactions, whereas even-numbered 
lanes are OBP« reactions. Lanes 1, 2 and 3, 4 are control reactions for a and « 
cells, respectively. c, Table of mating frequencies for different genetic 
crosses. Values are listed as mean frequency = s.d. 


demonstrate that mating can take place even within unisexual popu- 
lations of C. albicans cells, and that Bar1 acts to limit these types of 
self-fusion events. Notably, mating between wild-type a cells also 
occurred at an appreciable frequency (0.22%) if co-cultured with 
cells. In these ménage a trois matings, « cells presumably provided 
levels of «-pheromone that were sufficient to overcome Bar! activity 
and drive a—a mating. Levels of «-pheromone may be augmented by 
autocrine pheromone signalling in a cells, as deletion of MF ina cells 
led to a slight reduction in the frequency of ménage a trois mating 
(0.12%). In addition, « cells did not need to be in contact with a cells 
to drive same-sex mating (Supplementary Fig. 4). Mating between 
wild-type cells (0.043%) was similarly observed when co-cultured 
with a cells (see Fig. 3). 

If self-mating in unisexual populations is a direct consequence of 
autocrine pheromone signalling, then mutants deficient in signalling 
should be unable to self-mate. Both Abar1/Aste2 and Abar1/Amfz a 
strains did not exhibit autocrine signalling and also showed no 
detectable mating with wild-type a strains (<0.0002%; Fig. 3c). In 
contrast, Abar1/Aste3 a strains were competent for both autocrine 
signalling and unisexual mating (1.2%). The potential for unisexual 
mating is therefore dependent on the ability of cells to undergo 
autocrine pheromone signalling. To examine the efficiency of 
inbreeding and outbreeding by different C. albicans strains, SC5314 
Abar1 a strains were also mated with different clinical isolates 
(Fig. 4a). Again, Abar! strains mated with both a and « isolates at 
a similar frequency (average of 9.8% and 17.6%, respectively). In fact, 
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mating was generally more efficient between SC5314 and clinical 
isolates (up to 26% a—a mating) than that between SC5314 strains 
alone. Mating zygotes could be detected in unisexual crosses and 
fluorescent labelling of the nuclei confirmed that karyogamy had 
taken place (Fig. 4b and Supplementary Fig. 5). The tetraploid 
products of unisexual mating were generally stable, but could be 
induced to undergo chromosome loss to produce diploid cells, 
thereby completing a parasexual mating cycle (Supplementary Fig. 7). 
Our findings have implications for sexual reproduction in 
C. albicans and related species, as they reveal a mechanism for same- 
sex mating in hemiascomycetes. In C. albicans, heterothallic mating 
between a and & cells can occur, yet a predominantly clonal popu- 
lation structure suggests that outbreeding is rare'’. The discovery of 
self-mating indicates that homothallism could act to create a burst of 
genetic diversity by increasing recombination within unisexual popu- 
lations. Completion of the parasexual cycle could promote allele 
shuffling or increase karyotypic variation within the diploid genome. 
Furthermore, isolates of some related species are thought to be 
exclusively homothallic. For example, isolates of Lodderomyces 
elongisporus are able to form asci from identical cells, yet the genome 
apparently encodes only one pheromone gene (MF) and one recep- 
tor gene (STE2)”. Discovery of autocrine signalling and self-mating 
reveals a mechanism by which such species could reproduce sexually 
despite the absence of mating partners of the opposite sex. 
Homothallism has been well characterized in S. cerevisiae where it 
is a direct result of mating-type switching. C. albicans and related 
species are unable to undergo mating-type switching, yet we demon- 
strate two potential pathways by which self-mating can occur. First, 
inhibition of Barl activity promotes autocrine signalling and mating 
within unisexual populations of C. albicans. We hypothesize that 
certain niches in the mammalian host will favour self-fertilization 
by downregulating Barl activity. Second, we show that same-sex 
mating can occur because of pheromone signalling in conventional 
a—o mixes. In this case, pheromone secretion by even a limiting 
number of « cells can promote a—a mating (Supplementary 
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Figure 4 | Homothallic mating across clades in C. albicans. a, Mating 
frequencies of an $C5314 Abar1 a cell crossed with naturally occurring 
isolates from different clades of C. albicans. 1-8 are crosses between Abar1 
a cells and clinical « cells. 9-18 are crosses between Abar] a cells and clinical 
a cells. For a complete list of strains see Methods. b, Mating zygote formed 
between a Abar] a cell derived from $C5314 (DSY906, expressing HTB—RFP; 
histone H2B gene-red fluorescent protein) and a clinical a cell derived from 
THEM16614 (DSY908, expressing HTB—YFP). Arrows indicate fused nuclei 
expressing both fluorescent proteins. DIC (left), YFP fluorescence (observed 
using the GFP channel, middle) and RFP fluorescence (right) images are 
shown. Scale bar, 15 um. 
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Table 3). In addition, certain combinations of C. albicans a and 
a strains are incompatible for heterothallic mating as nuclei fail to 
undergo karyogamy”'. Our results show that pheromones produced 
during contact between two such ‘infertile’ strains could still result in 
self-mating within sexes. 

The program of same-sex mating in C. albicans shows unexpected 
similarities to that in other human fungal pathogens. In the basidio- 
mycetous fungus Cryptococcus neoformans mating between a and « 
isolates can occur, yet clinical and environmental isolates are over- 
whelmingly of the « mating type. Same-sex mating between « isolates 
of C. neoformans has been observed both in the laboratory and in the 
wild, and is thought to be the predominant mode of sexual reproduc- 
tion in this organism***°*. Pneumocystis carinii is another fungal 
pathogen predicted to undergo a modified sexual cycle involving cells 
of the same mating type’*. It thus seems that autocrine pheromone 
signalling and homothallism may be traits shared by a diverse group of 
medically relevant fungi. These pathogens have retained the ability to 
undergo sexual reproduction and yet have evolved modifications to 
these programs that support their survival and propagation. 
Elucidating the respective mechanisms of homothallism and hetero- 
thallism will therefore be critical to the understanding of these species 
as human pathogens. 


METHODS SUMMARY 


C. albicans strains were derived from $C5314 except where otherwise noted. 
Mutant strains were created using either the PCR fusion or SAT] flipper methods 
described previously””**. For a complete list of strains and primers see 
Supplementary Tables 1 and 2. Media were made as described previously’. All 
C. albicans cells used in this study were in the opaque phase as this is the mating- 
competent form of the organism®. Quantitative PCR was performed on a cells 
that had been treated with 10 pg ml! «-pheromone for 4 h in Spider medium, as 
described previously'*”. For analysis of colony morphology, patches of 
C. albicans cells (~0.5 cm”) were spaced 0.75cm apart on YPD medium and 
imaged after 4 days growth at 30 °C using a Zeiss Stemi 2000-C stereoscope with 
an Infinity 1 camera. Cells were examined using a Zeiss Axioplan 2 microscope 
equipped with a Hamamatsu-ORCA camera. Mating frequencies were deter- 
mined by mixing 2 X 10’ cells of differently marked auxotrophic strains on 
Spider medium and incubating at room temperature for 3 days. Cells were then 
resuspended in water and plated onto selective media to determine frequency of 
mating products’°. For analysis of a~a mating in the presence of « cells, 2 X 10” 
cells of each of three strains were mixed and incubated on Spider medium (two 
differently marked a strains and an «& strain negative for both markers). Cells 
were recovered and analysed as above. The same strategy (two differently marked 
a strains and an a strain negative for both markers) was applied to detect o—-« 
mating. Flow cytometry to determine cell ploidy was performed by staining DNA 
with Sytox Green and analysis on a FACSCalibur*. PCR to determine mating type 
was performed using primers directed at OBPa/« as described previously”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Strain/plasmid construction. The parental strains for SC5314-derived mutants 
were based on auxotrophic strains described previously'®. Gene deletions were 
constructed using the fusion PCR or SAT! flipper methods described previ- 
ously’””*, All strains are listed in Supplementary Table 1. The bar] mutant strains 
CAY320/321 were generated as described previously'® except an ARG4 marker 
was used in place of the HIS1 marker. To delete STE2, oligonucleotides 1/2 were 
used to amplify the 5’ flank and oligonucleotides 3/4 to amplify the 3’ flank of the 
gene from SC5314. Oligonucleotides 1/4 were then used to generate a disruption 
cassette containing either the LEU2 or HIS1 selectable marker’’. The cassette was 
transformed into RBY1118 and integration checked using oligonucleotides 5 or 6 
together with primers internal to the marker, and loss of the ORF was confirmed 
by PCR using oligonucleotides 7/8, resulting in strains RBY1107 and RBY1108. A 
similar method was used to delete STE3 and MFz genes. For STE3, oligonucleo- 
tides 9/10 and 11/12 were used to amplify the 5’ and 3’ flanks, respectively, 
whereas for MFz, oligonucleotides 17/18 and 19/20 were used to amplify the 
flanking regions. Disruption cassettes were then generated using oligonucleo- 
tides 9/12 and 17/20 to amplify the selective marker together with flanking gene 
sequences. Correct integration of the cassette at STE3 was confirmed using 
flanking primers 13/14 and loss of the ORF using 15/16, resulting in strains 
RBY1113 and RBY1114. For MFz, integration was checked using oligonucleo- 
tides 21/22 and deletion of the ORF was confirmed using oligonucleotides 23/24 
resulting in strains RBY1081 and RBY1082. To generate the plasmid pDS1 for 
deletion of BARI, oligonucleotides 25/26 were used to generate a 5’ flank that 
was digested with Apal and Xhol and ligated into pSFS2a. Subsequently, oligo- 
nucleotides 27/28 were used to generate a 3’ flank that was digested with SacI and 
SaclI and ligated into the vector to generate plasmid pDS1. Apal/SacI-digested 
pDS1 was then used to transform strains RBY1107, RBY1113 and RBY1081, to 
generate bar1 mutant strains DSY760/761, DSY792/793 and DSY762/768, 
respectively. Integration was checked using oligonucleotides 29/30 paired with 
oligonucleotides internal to pSFS2a, and loss of the ORF was confirmed using 
oligonucleotides 31/32. To generate plasmids for integration of GFP under the 
control of a test promoter, GFP was amplified from plasmid pNIM1*! using 
oligonucleotides 33/34, cut with Apal and SalI and ligated into an Apal/Xhol- 
digested pSFS2a, generating pRS2. The promoters of FUS1 and FIG1 were then 
amplified using oligonucleotides 35/36 and 37/38, respectively. The PCR 
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products were digested with KpnI/Apal and ligated into pRS2 to generate 
pDS2 and pDS3 respectively. The plasmid pDS2 was linearized with AflII and 
transformed into RBY1117 and RBY1197 and integration checked using oligo- 
nucleotides 39/40 to generate DSY702 and DSY690, respectively. The plasmid 
pDS3 was linearized with AflII and transformed into RBY1117 and RBY1220, 
and integration checked using oligonucleotides 40/41 to generate DSY700 and 
DSY696, respectively. To generate the mating tester strains DSY170/171, 
RBY1118 was transformed with a fragment containing SAT! from the plasmid 
pNIM1. This fragment targets an intergenic region of chromosome 5 near MTL 
and was created using oligonucleotides 42/43. To generate the strain CAY371, a 
WORI-YFP fusion construct (white-opaque regulator 1-yellow fluorescent 
protein) was transformed into CAY320 (ref. 32). To generate fluorescently 
labelled nuclei, strains RBY1197 and DSY168 were transformed with HTB- 
RFP and HTB-YFP constructs as described previously~’** to generate strains 
DSY906 and DSY908, respectively. Select clinical isolates were also obtained 
from D. MacCallum of the Aberdeen Fungal group and grown on sorbose media 
to generate homozygous a and & derivatives, as described previously”. 
Clinical mating crosses. All clinical isolates were crossed with the SC5314 Abar1 
a strain, CAY371. Clinical « isolates correspond as follows: 1, AM2003/0165; 2, 
L1086 derivative (DSY159); 3, YSU751 derivative (DSY161); 4, Hun92 derivative 
(DSY165); 5, IHEM16614 derivative (DSY183); 6, 19F; 7, 78048; 8, J981315 
derivative (DSY167); CAY371 with WO-1 is not shown . Clinical a isolates 
correspond as follows: 9, L1086 derivative (DSY164); 10, P37005; 11, 12C; 12, 
L26; 13, AM2003/0191 derivative (DSY162); 14, IHEM16614 derivative 
(DSY168); 15, T101; 16, J981315 derivative (DSY194); 17, RIH09; 18, 
AM2005/0377. 
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Programming cells by multiplex genome engineering 


and accelerated evolution 


Harris H. Wang’”*, Farren J. Isaacs'*, Peter A. Carr*’, Zachary Z. Sun®, George Xu®, Craig R. Forest’ 


& George M. Church! 


The breadth of genomic diversity found among organisms in nature 
allows populations to adapt to diverse environments'”. However, 
genomic diversity is difficult to generate in the laboratory and new 
phenotypes do not easily arise on practical timescales*. Although in 
vitro and directed evolution methods*” have created genetic 
variants with usefully altered phenotypes, these methods are limited 
to laborious and serial manipulation of single genes and are not used 
for parallel and continuous directed evolution of gene networks or 
genomes. Here, we describe multiplex automated genome engineer- 
ing (MAGE) for large-scale programming and evolution of cells. 
MAGE simultaneously targets many locations on the chromosome 
for modification in a single cell or across a population of cells, thus 
producing combinatorial genomic diversity. Because the process is 
cyclical and scalable, we constructed prototype devices that auto- 
mate the MAGE technology to facilitate rapid and continuous 
generation of a diverse set of genetic changes (mismatches, inser- 
tions, deletions). We applied MAGE to optimize the 1-deoxy-p- 
xylulose-5-phosphate (DXP) biosynthesis pathway in Escherichia 
coli to overproduce the industrially important isoprenoid lycopene. 
Twenty-four genetic components in the DXP pathway were modi- 
fied simultaneously using a complex pool of synthetic DNA, 
creating over 4.3 billion combinatorial genomic variants per day. 
We isolated variants with more than fivefold increase in lycopene 
production within 3 days, a significant improvement over existing 
metabolic engineering techniques. Our multiplex approach 
embraces engineering in the context of evolution by expediting 
the design and evolution of organisms with new and improved 
properties. 

With the advent of next-generation fluorescent DNA sequencing”®, 
our ability to sequence genomes has greatly outpaced our ability to 
modify genomes. Existing cloning-based technologies are confined to 
serial and inefficient introduction of single DNA constructs into cells, 
requiring laborious and outdated genetic engineering techniques. 
Whereas in vivo methods such as recombination-based genetic 
engineering (recombineering) have enabled efficient modification of 
single genetic targets using single-stranded DNA (ssDNA)!""*, no 
such attempts have been made to modify genomes on a large and 
parallel scale. MAGE provides a highly efficient, inexpensive and auto- 
mated solution to simultaneously modify many genomic locations 
(for example, genes, regulatory regions) across different length scales, 
from the nucleotide to the genome level (Fig. 1). 

Efficiency of the MAGE process was characterized using a modi- 
fied E. coli strain (ECNR2). Mediated by the bacteriophage A-Red 
ssDNA-binding protein 8, allelic replacement is achieved in ECNR2 
by directing ssDNA or oligonucleotides (oligos) to the lagging strand 


of the replication fork during DNA replication’. We optimized a 
number of parameters (see Supplementary Information, Supplemen- 
tary Fig. 2 and Supplementary Table 1) to maximize efficiency of 
oligo-mediated allelic replacement. To generate sequence diversity 
in any region of the chromosome by allelic replacement, a pool of 
targeting oligos is repeatedly introduced into a cell. Under optimized 
conditions, we can successfully introduce new genetic modifications 
in >30% of the cell population (Supplementary Fig. 2d) every 
2-2.5h. 

Oligo-mediated allelic replacement is capable of introducing a 
variety of genetic modifications at high efficiency. The efficiency of 
generating a mismatch or insertion modification is correlated to the 
amount of homologous sequence between the oligo and its chromo- 
somal target (Fig. 2a, b); the efficiency of producing a deletion modi- 
fication is correlated to the size of the deletion (Fig. 2c). Figure 2d 
shows that the predicted two-state hybridization free energy AG (ref. 
15) between the oligo and target chromosomal sequence is a pre- 
dictor of the allelic replacement efficiency. Thus, in a pool of oligos 
with degenerate sequences, oligos with more homology to the target 
will be incorporated in the chromosome at a higher frequency than 
those with less homology. This feature of MAGE enables tunable 
generation of divergent sequences along favourable evolutionary 
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Figure 1| Multiplex automated genome engineering enables the rapid and 
continuous generation of sequence diversity at many targeted 
chromosomal locations across a large population of cells through the 
repeated introduction of synthetic DNA. Each cell contains a different set of 
mutations, producing a heterogeneous population of rich diversity (denoted 
by distinct chromosomes in different cells). Degenerate oligo pools that 
target specific genomic positions enable the generation of a diverse set of 
sequences at each chromosomal location. 
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Figure 2 | Characterization of allelic replacement efficiency as a function of 
the type and scale of genetic modifications. a, Introducing mismatch 
mutations of up to 30 bp. b, Inserting exogenous sequences of up to 30 bp. 
c, Removing up to 45 kbp of chromosomal sequence using a single oligo. 
d, Correlation of replacement efficiency and two-state hybridization energy 
AG between the oligo and the targeted complement region in the genome. 
See Supplementary Fig. 1 for an illustration of oligo interaction with 
genomic targets and Supplementary Table 3 for a list of oligos and 
corresponding AG values. Dashed line is the linear regression correlation 
iy 0.288x — 13.7, R” = 0.799). All oligos used were 90 bp with two 
phosphorothioate bonds at the 3’ and 5’ ends. All error bars indicate + s.d.; 
n= 3. 


To determine the rate at which MAGE generates sequence diversity, 
we used three different 90-mer oligos to produce mismatch changes in 
a targeted region of the lacZ gene in three distinct cell populations. The 
cNéand cN30 oligos contained 6 and 30 consecutive degenerate bases, 
respectively; the iN6 oligos contained 6 degenerate bases interspersed 
across a 30-bp region (Fig. 3). For these cell populations, the targeted 
lacZ region was sequenced in 96 random clonal isolates after MAGE 
cycles 2, 5, 10 and 15 that provided a snapshot of the genotypic vari- 
ation in each population. Through successive cycles of MAGE, the 
chromosomal sequence of the lacZ region increasingly diverged away 
from wild type (Fig. 3). For cN6, after five MAGE cycles, we detected 
an average change of 3.1 bp per cell across the population (Fig. 3 inset, 
blue line), which equates to the generation of more than 4.3 X 10’ bp 


lacZ gene atgACCATGATTACGGATTCACTGGCCGTCGTTT TACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCC 
ATGATTACGGATTCACTGGCCGTCGTTTTACANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCAACTTAATCGCCTTGCAGCACATCCC 
ATGATTACGGATTCACTGGCCGTCGTTTTACANCGTCGNGAC TGNGAAAANCCTGGNGTTACNCAACTTAATCGCCTTGCAGCACATCCC 
ATGATTACGGATTCACTGGCCGTCGT TT TACAACGTCGTGACNNNNNNAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCC 
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Figure 3 | Sequence diversity generated across three separate cell 
populations as a function of the number of MAGE cycles. Three 90-mer 
oligo pools were investigated: cN30, iN6 and cN6. cN30 contains oligos with 
30 bp of consecutive degeneracy; iN6 contains oligos with 6 bp of degeneracy 
spaced every 5 bp; cN6 contains oligos with 6 bp of consecutive degeneracy. 
Frequency of strains in each population that contains 0 to 7+ bp of 
differences from the wild-type lacZ sequence are colour-coded. The inset 
shows average number of base pairs changed from wild type across the whole 
cell population as a function of the number of MAGE cycles using the three 
oligo pools cN30 (orange line), cN6 (blue line) and iN6 (red line). 
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of variation per day (3.1 bp changes per five cycles in7 X 10° cells at 10 
cycles per day). Within 15 cycles, cell populations containing all 
possible N6 genotype combinations were generated using either cN6 
or iN6 oligos. Because the replacement efficiency for a 30-bp mis- 
matching oligo is lower (1.5% from Fig. 2a), only 21.8% of the 
cN30 cell population had undergone allelic replacement after 15 
cycles. We detected an average change of 5.6bp per cell from the 
wild-type sequence across the whole cN30 population (Fig. 3 inset, 
orange line). 

The depth at which MAGE generates diversity is determined by a 
combination of three factors: (1) the degree of sequence variation 
desired at each locus; (2) the number of loci targeted; and (3) the 
number of MAGE cycles performed. When a single locus is targeted 
using a degenerate oligo pool, genetic diversity is generated across the 
population at that locus and is a function of the oligo pool complexity 
only. If more than one locus is targeted simultaneously, diversity is 
generated through the combinatorial arrangement of the different 
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Figure 4 | MAGE automation. a, Detailed schematic diagram of MAGE 
prototype including climate-regulated growth chambers with real-time cell 
density monitors (green), anti-fouling fluidics for transfer of cells between 
growth chambers and exchange of media and buffers (blue), and real-time 
generation of competent cells for transformation with synthetic DNA 
(yellow). Cultures are carried through different chambers at different 
temperature regimes (30 °C, 42 °C, 4 °C) depending on the necessary MAGE 
steps (that is, cell growth, heat-shock, cooling). Cells are made 
electrocompetent by concentration onto a filter membrane and 
resuspension with wash buffer. Oligos are delivered into cells by 
electroporation. b, Step-by-step diagram of MAGE cycling steps at a total 
run time of 2—2.5h per cycle. Owing to high voltage (18 kV cm ') 
electroporation, ~95% of cells are killed at each cycle. Hence, the 
electroporation event serves to both introduce oligos into cells and to dilute 
the cell population, cells are then recovered and grown to mid-log phase 
(7 X 10* cells ml’) in liquid medium for the subsequent cycle. 
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modified loci. The frequency at which each locus is modified can be 
computationally predicted through a binomial distribution 
(Supplementary Fig. 3). Although the cell population at any cycle 
may reflect only a subset of all variants theoretically possible, we can 
cumulatively generate more variants than the actual size of the cell 
population (~7 X 10° cells) through successive MAGE cycles. Thus, 
we can generate all variants regardless of population size through 
computational predictions and continuous cycling by MAGE. 

Given the cyclical and scalable nature of our approach, we con- 
structed an integrated prototype device that automates the MAGE 
process to enable fast and reliable cellular programming. The device 
contains growth chambers to maintain healthy cell cultures and elec- 
troporation modules to repeatedly deliver DNA into the cells, thereby 
facilitating genome engineering and evolution (Fig. 4, Methods and 
Supplementary Information). Complex culturing conditions can be 
programmed into the device for growth of a diverse set of organisms 
and ecosystems. 

To demonstrate an application of the MAGE process, we optimized 
metabolic flux through the DXP biosynthesis pathway to overproduce 
the isoprenoid, lycopene, in an E. coli strain (ECHW2) that contained 
the pAC-LYC plasmid that is necessary for the final steps of lycopene 
production. Twenty endogenous genes (dxs, dxr, ispD, ispE, ispG, 
ispH, idi, ispA, appY, rpoS, crl, elbA, elbB, yjiD, purH, rnlA, ygeT, 
ycgZ, ymgA, ariR) documented to increase lycopene yield'®'” were 
targeted to tune translation (Fig. 5a). Specifically, for each of the 20 
genes, 90-mer oligos containing degenerate ribosome binding site 
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Figure 5 | Optimization of the DXP biosynthesis pathway for lycopene 
production. a, Genomic positions of 24 targeted genes with the RBS 
optimization strategy on the left (red) and gene knockout strategy on the 
right (blue). The gene knockout strategy involves the introduction of two 
nonsense mutations. All 90-mer oligos contain two phosphorothioated 
bases at the 3’ and 5’ termini. b, Black bars represent the growth rate of 
isolated variants (ECHW2a-f) relative to the ancestral ECHW1 strain. White 
bars represent lycopene production in p.p.m., which is normalized by dry 
cell weight in ancestral and mutant strains. Colour-coded labels in each 
white bar represent genetic modifications found by sequencing. All error 
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(RBS) sequences (DDRRRRRDDDD; D = A, G, T; R= A, G) flanked 
by homologous regions on each side were used, with a total pool 
complexity of 4.7 X 10° (3° X 2° X 20). The replaced RBS regions 
were designed to be more similar to the canonical Shine-Dalgarno 
sequence (TAAGGAGGT)", giving rise to enhanced translation effi- 
ciency. Additionally, four genes (ytjC, fdhF, aceE, gdhA) from second- 
ary pathways" were targeted for inactivation by oligos that introduced 
two nonsense mutations in the open reading frame, further improving 
flux through the DXP pathway. In contrast to prior strategies*”** that 
were experimentally limited by the number of genetic components 
that could be manipulated at once, here we optimized 24 genes simul- 
taneously to maximize lycopene production. 

As many as 15 billion genetic variants (4.3 X 10° bp variations per 
cycle for 35 MAGE cycles) were generated. Screening of variants was 
done by isolating colonies that produced intense red pigmentation on 
Luria—Bertani agar plates. Variants were isolated from ~10° colonies 
screened after 5-35 cycles of MAGE (see Supplementary Information), 
some exhibiting as much as a fivefold increase in lycopene production 
relative to the ECHW2 ancestral strain (Fig. 5b). Under similar experi- 
mental conditions, our highest lycopene yield of ~9,000 p.p.m. (1g per 
g dry cell weight) is better than documented yields'””’. Sequencing of 
six variants (ECHW2a-EcHW2f) revealed RBS convergence towards 
consensus-like Shine-Dalgarno sequences in genes localized at the 
beginning and end of the biosynthesis pathway (dxs, dxr, idi, ispA) as 
well as various gene knockouts from secondary pathways (AytjC, 
AgdhA, AfdhF) (Fig. 5c and Supplementary Table 2). 


Gene knockouts 


. . GAGTGACGTATTTAGGGC > 
atgTTA. . GAGCGACGTATT CAGGGC 


24 


at gTTA. . GAGTGACGTATTTAGGGC 
stop stop 


© 
oN aly, 


G3P_ Pyruvate —> —> _+ 


CDP-MEP 
1 ispF 
MEC 
1 isopG 
HMBPP 


Jia 


IPP <—> DMAPP 


FPP 


ispH 


Q 
= 
n 
ie} 
& 
S 
S 
3 
a 
3 


dxs AfdhF 


| crtEBI 


EcHW2 EcHW2a EcHW2b EcHW2c EcHW2d EcHW2e EcHW2f 
———————————— ESS 
MAGE-cycled strains 


Lycopene 


bars indicate + s.d.; n = 3. c, Modifications to the lycopene biosynthesis 
pathway of isolated variants ECHW2a-f with relevant genes highlighted by 
rectangular boxes. Blue labels represent knockout targets, red labels 
represent RBS tuning targets. AcCoA, acetyl-CoA; CDP-ME, 
4-diphosphocytidyl-2-C-methyl-p-erythritol; CDP-MEP, 
4-diphosphocytidyl-2C-methyl-p-erythritol-2-phosphate; DMAPP, 
dimethylallyl diphosphate; FPP, farnesyl diphosphate; G3P, glyceraldehyde 
3-phosphate; HMBPP, (E)-4-hydroxy-3-methylbut-2-enyl-diphosphate; 
IPP, isopentenyl diphosphate; MEP, 2-C-methyl-p-erythritol-4-phosphate; 
MEC, 2C-methyl-p-erythritol-2,4-cyclodiphosphate. 
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Different tuning parameters of the lycopene pathway can be indi- 
vidually and combinatorially assessed in the isolated variants. For 
example, translation optimization of idi alone (ECHW2a) increased 
lycopene production by 40%. Whereas optimizing dxs and idi 
increased lycopene production by 390% (EcHW2e), additional 
optimization at rpoS and dxr, along with inactivation of ytjC, 
improved the growth rate of ECHW2f to that of the ECHW1 ancestor. 
Interestingly, rpoS is the alternative RNA polymerase subunit sigma 
factor o°, the master stress response regulator, and its upregulation 
can increase stress resistance to the accumulation of lycopene, a very 
hydrophobic molecule’’. ytjC is an uncharacterized phosphoglyce- 
rate mutase enzyme and is thought to increase metabolic flux 
through the DXP pathway by increasing the accumulation of glycer- 
aldehyde 3-phosphate intermediates’’. Inactivation of glutamate 
dehydrogenase (gdhA) increases lycopene production, but causes a 
32% decrease in growth rate of EcCHW2b relative to wild type. 
Combinations of genetic modifications can also be assessed against 
each other. For example, an optimized dxs and idi strain (ECHW2e) 
produces 12% more lycopene than a strain with dxs and AfdhF 
(EcHW2d) and 27% more than a strain with dxs and ispA modifica- 
tions (ECHW2c). The optimized DXP biosynthesis pathway pre- 
sented here can be used to produce many other isoprenoid 
compounds of industrial and pharmaceutical relevance”. 

The diversity (that is, the degree of sequence change per target) 
generated by MAGE is adjustable and the specificity of targeting is 
always high. Oligos with defined sequences produce well-defined 
modification, whereas oligos with degenerate sequences produce 
high-diversity modifications tailored for exploring a vast sequence 
space. In this study, we used well-defined oligos to inactivate protein- 
coding sequences and high-diversity degenerate oligos to modify and 
sample different RBS sequences. We have also used the MAGE plat- 
form to perform whole-genome recoding of E. coli to enhance the 
incorporation of non-natural amino acids into proteins” and con- 
struct safer and multi-virus-resistant strains (F.J.I. et al., manuscript 
in preparation). MAGE is thus a complementary technology to de 
novo genome synthesis”, allowing the tuning of synthetic and natural 
genomes in vivo for various applications. 

MAGE is also an accelerated evolution platform that permits the 
repeated introduction and maintenance of many neutral (or deleteri- 
ous) mutations in the cell population. Although these mutations 
would normally disappear in the population via genetic drift or natural 
selection, MAGE accelerates the rate of their accumulation in any 
individual cell, thus increasing the likelihood of finding sets of muta- 
tions that may interact synergistically to produce a surprisingly bene- 
ficial phenotype. Using this technology, we could engineer or evolve 
cells with higher transfection efficiency (for example, harnessing 
natural competence systems” ), increased allelic replacement efficiency 
(for example, expressing higher levels or mutants” of the A-Red B 
protein) and perform large-scale bacterial artificial chromosome 
engineering. The simple allelic replacement mechanism could make 
this method amenable for use in other organisms, given that other 
ssDNA-binding protein homologues are functional”. Currently, 
30US dollars of commercially synthesized oligos can introduce up 
to 27 bp of modification at full degeneracy for a single genomic target. 
To target many loci, obtaining oligos from programmable DNA 
microchips” can significantly decrease the cost in comparison to 
traditional oligo synthesis. We envision that large-scale pipelines to 
program synthetic organisms and ecosystems'® will greatly benefit 
from integration of hardware, software and wetware to engineer and 
evolve microbial, plant and animal systems. 


METHODS SUMMARY 

Strains. The EcNR2 strain is MG1655 with -prophage::bioA/bioB and 
cmR:mutS. The EcHW2 strain is MG1655 with A-prophage::bioA/bioB, 
kanR::mutS and pAC-LYC plasmid. EcHW2a-f strains were MAGE-cycled 
EcHW2? derivatives containing DXP pathway modifications. Details of strains 
used are described in Methods. 


LETTERS 


Allelic replacement. Liquid cell cultures were inoculated from colonies obtained 
from a plate and grown at 30°C to an absorbance (600nm) of ~0.7 
(~7 X 10° cellsml~') in a rotor drum at 200r.p.m. To induce expression of 
the A-Red recombination proteins (Exo, Beta and Gam), cell cultures were 
shifted to 42°C for 15min and then immediately chilled on ice. In a 4°C 
environment, 1 ml of cells was centrifuged at 16,000g for 30s. Supernatant 
medium was removed and cells were resuspended in dH2O (Gibco catalogue 
number 15230). This process was repeated twice with water. Supernatant water 
was removed and oligos suspended in water were added to the cell pellet. The 
oligos/cells mixture was transferred to a pre-chilled 1 mm gap electroporation 
cuvette (Bio-Rad) and electroporated with a Bio-Rad GenePulser electropora- 
tion system under the following parameters: 1.8kV, 200Q and 25 wF. LB-min 
medium (1-3 ml) was immediately added to the electroporated cells. The cells 
were recovered from electroporation and grown at 30 °C for 2-2.5h. Once cells 
reached mid-logarithmic growth they were used in additional MAGE cycles, 
isolated or assayed for genotype and/or phenotype analysis. 

Lycopene assay. ECHW2 and derivatives (ECHW2a-f) were quantitatively 
assessed for lycopene production after 24 h of growth in LB-min-chlorampheni- 
col liquid medium at 30 °C. Lycopene was extracted and quantified at an absor- 
bance of 470 nm (see Methods). Lycopene yield was calculated by normalizing 
the amount of lycopene extracted to the dry cell weight. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Strains and culture conditions. The i prophage was obtained from strain 
DY330"!, modified to include the bla gene and introduced into wild-type 
MG1655 E. coli by P1 transduction at the bioA/bioB gene locus and selected on 
ampicillin to yield the strain ECNR1 (A-Red*). Replacement of mutS with the 
chloramphenicol resistance gene (cmR cassette) in ECNR1 produced EcNR2 
(mutS”, A-Red*). ECNR2 was grown in low salt LB-min medium (10 ¢ tryptone, 
5 g yeast extract, 5 g NaCl in 11 dH,O) for optimal electroporation efficiency. A 
premature stop codon was introduced into the cmR gene of EcNR2 with oligo 
cat_fwd_stop (Supplementary Table 3) to produce EcFI5, thus inactivating the 
cmR gene. An oligo (cat_fwd_restore) containing the wild-type sequence was used 
to restore the CmR phenotype. The pAC-LYC plasmid* containing genes crtE, 
crtB and crtI was electroporated into EcNR1 to generate ECHW1, which produces 
lycopene at basal levels. Replacement of mutS with a kanamycin resistance gene in 
EcHW1 produced EcCHW2. 

Oligonucleotides and DNA sequencing. All oligonucleotides were obtained from 
Integrated DNA Technologies with standard purification. Oligonucleotides used in 
the MAGE process contained the following modifications: (1) 30-110 bp in length 
for optimization experiments; (2) different numbers of phosphorothioated bases; 
and (3) degenerate nucleotides as described elsewhere in this paper. Additional 
primers were purchased to amplify relevant genetic regions of the lycopene pathway 
to sequence strains that expressed high levels of lycopene. DNA sequencing to 
confirm allelic replacements was performed by Agencourt Bioscience. 

LacZ and chloramphenicol replacement efficiency assays. Replacement effi- 
ciency was characterized by performing the allelic replacement protocol on 
EcNR2 cells using 90-mer oligos (Supplementary Table 3) that produced a 
premature stop codon in the chromosomal lacZ gene. In general, 250-500 cells 
were plated on LB-min containing 5-bromo-4-chloro-3-indoyl-B-p-galactoside 
and isopropyl-B-b-thiogalactoside (USB Biochemicals) agar plates. Efficiency of 
allelic replacement was calculated by taking the ratio of the number of white 
colonies to the total number of colonies on plates. A similar strategy was used in 
the cmR gene recovery experiments with strain EcFI5 where 30-110mer oligos 
were used to determine optimal oligo length for allelic replacement 
(Supplementary Fig. 2a). These oligos contained two phosphorothioate bonds 
at both the 5’ and 3’ termini. Cells were plated on LB-min-chloramphenicol and 
LB-min agar plates and grown overnight. Efficiency of allelic replacement 
was calculated by taking the ratio of the number of colonies on LB-min- 
chloramphenicol plates to the number of colonies on LB-min plates. 

MAGE automation device. Cells were grown in sterilized 10-ml glass vials placed 
in thermally conductive blocks. The growth chambers were climate-regulated 
through temperature controllers that actuate Peltier heating/cooling elements. 
Cultures acclimatized quickly (<15s) in chambers held at different tempera- 
tures due to small volumes (for example, 3 ml). Aeration of the culture was 
achieved through agitation of the chambers at 300 r.p.m. using an orbital shaker. 
Real-time monitoring of growth rates was achieved by detecting changes in light 
transmittance across the chamber from light-emitting diodes emitting at 
600 nm. Cultures were transferred between chambers through solenoid isolation 
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valves using syringe pumps. Cells were made electrocompetent by a filtration 
system, which uses a syringe pump to concentrate cells onto a filter membrane 
(0.22 um pore size) and resuspend them off the membrane with appropriate 
electroporation buffer (for example, dH,O). Single-stranded oligos (or PCR 
products) were electroporated into cells in a conductive cuvette with an electric 
pulse (18kVcm7'). After each cycle, the entire system (chambers, syringes, 
filters) was washed with 70% ethanol followed by dH,O three times to reduce 
contamination and biofilm formation. All instruments were digitally controlled 
through software written in the LabView programming environment (National 
Instruments). 

Cultures were initially inoculated into 3 ml of medium in the growth chamber. 

The device then executed repeatedly and continuously through the following 
steps: (1) grow cells at 30°C to a pre-set density (that is, OD¢00 nm of 0.7); (2) 
induce cells for allelic replacement via 42 °C heat shock for 15 min; (3) chill cells 
at 4°C to halt cellular metabolism; (4) wash cells through 15-20 iterations of 
filtration and resuspension with dH,O; (5) mix cell suspension and synthetic 
DNA; (6) deliver DNA into cells by electroporation; and (7) resuspend electro- 
porated cells with growth media. 
Colorimetric screen and assay for lycopene production. Cells from the cycled 
EcHW2 population were plated on LB-min-chloramphenicol agar plates and 
grown for | day at 30 °C and 2 days at room temperature to produce red colonies. 
The 24 gene targets were divided into three oligo pools, one containing 10 
targets, a second containing 14 targets and a third containing all 24 targets, which 
were cycled through the MAGE process in three separate cultures. In total, 10° 
colonies with increased colour intensity by visual inspection were screened after 
cycles 5, 10, 15, 20, 25, 30 and 35. From 127 isolates, six strains (ECHW2a-f, with 
representation from each pool) were selected for direct DNA sequencing across 
all gene targets and quantitatively measured for lycopene production. For lyco- 
pene quantification, these isolated colonies were grown in LB-min medium for 
24h. Lycopene was extracted from 1 ml of cells as follows: centrifuged at 16,000g 
for 30s, removal of supernatant media and resuspended with water, and re- 
centrifuged at 16,000g for 30 s. Once the supernatant was removed, the cells were 
resuspended in 200 pl acetone and incubated in the dark for 15 min at 55 °C with 
intermittent vortexing. The mixture was centrifuged at 16,000g for 30s and the 
supernatant lycopene solution was transferred to a clean tube for quantification. 
Absorbance at 470 nm of the extracted lycopene solution was measured using a 
spectrophotometer and calibrated against a known lycopene standard (Sigma- 
Aldrich, catalogue number L9879) to determine the lycopene content. Dry cell 
weight was determined by baking a washed cell pellet at 105 °C for 24h in the 
dark. Lycopene yield of different ECHW2 derivatives (ECHW2a-f) was calculated 
by normalizing the amount of lycopene extracted to the dry cell weight. 
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Intrinsic light response of retinal horizontal cells of 


teleosts 


Ning Cheng’’+, Takashi Tsunenari’*+ & King-Wai Yau” 


The discovery of intrinsically photosensitive retinal ganglion cells 
has overthrown the long-held belief that rods and cones are the 
exclusive retinal photoreceptors’’. Intrinsically photosensitive 
retinal ganglion cells use melanopsin’ as the photopigment, and 
mediate non-image-forming visual functions such as circadian 
photoentrainment. In fish, in situ hybridization studies indicated 
that melanopsin is present in retinal horizontal cells**—lateral 
association neurons critical for creating the centre-surround 
receptive fields of visual neurons. This raises the question of 
whether fish horizontal cells are intrinsically photosensitive. 
This notion was examined previously in flat-mount roach retina, 
but all horizontal-cell light response disappeared after synaptic 
transmission was blocked*, making any conclusion difficult to 
reach. To examine this question directly, we have now recorded 
from single, acutely dissociated horizontal cells from catfish and 
goldfish. We found that light induced a response in catfish cone 
horizontal cells, but not rod horizontal cells, consisting of a modu- 
lation of the nifedipine-sensitive, voltage-gated calcium current. 
The light response was extremely slow, lasting for many minutes. 
Similar light responses were observed in a high percentage of 
goldfish horizontal cells. We have cloned two melanopsin genes 
and one vertebrate ancient (VA) opsin gene from catfish. In situ 
hybridization indicated that melanopsin, but less likely VA opsin, 
was expressed in the horizontal-cell layer of catfish retina. This 
intrinsic light response may serve to modulate, over a long time- 
scale, lateral inhibition mediated by these cells. Thus, at least in 
some vertebrates, there are retinal non-rod/non-cone photorecep- 
tors involved primarily in image-forming vision. 

We performed whole-cell recordings from single, acutely disso- 
ciated horizontal cells from catfish retina. This retina has rod-driven 
horizontal cells and a single type of cone-driven horizontal cell”*. 
Rod horizontal cells are larger, with >50-p1m-long dendrites and a 
smooth somatic membrane (Fig. la, right)”*. Cone horizontal cells 
are more compact, with short processes and a slender axon-like 
process often still attached after dissociation, and spine-like protru- 
sions from the soma”* (Fig. 1a, left). We examined cone horizontal 
cells first because they have been studied more extensively* '*. When 
held steadily at —20 to —30mV (approximate in situ membrane 
potential in darkness) or more hyperpolarized/depolarized poten- 
tials, there was no obvious light-induced current. Catfish cone 
horizontal cells have four reported voltage-gated currents”'*: nifedi- 
pine-sensitive calcium current, tetrodotoxin-sensitive sodium cur- 
rent, delayed-rectifier potassium current, and inward-rectifier 
potassium current (see Supplementary Fig. 1). Accordingly, to screen 
for a light response more closely, we isolated each current pharma- 
cologically (see Methods) and tested the effect of light. Indeed, we 
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observed a modulation of the cone horizontal cell’s Ca”* current by 
light. Rod horizontal cells did not show this effect (see below). 

In physiological concentrations of extracellular ions, the current— 
voltage (I-V) relationship of the isolated Ca** current of a cone 
horizontal cell showed modulation by light in the range of 
+ 30 mV as assayed by voltage ramps. In darkness, the Ca”* current 
was 160 + 62 pA (mean + s.d.) at peak (~0 mV) from 13 cells. For 
seven cells, a bright flash (1 X 10° to 3 X 10° photons jum ~* at 
480 nm, 40-j1m-diameter spot centred on soma; see Methods) trans- 
iently reduced this current, then enhanced it for minutes (‘biphasic’ 
response, Fig. 1b; by a maximum of 10 + 4 and 28 + 12 pA, respect- 
ively, or 9+ 8% and 20 + 9% of dark value, right panel). For five 
cells, the same flash produced only a slow decrease in the current 
(‘monophasic’ response, Fig. 1c; by 25 + 13 pA, or 14 + 4% of dark 
value, right panel). One cell was not light sensitive. For both response 
types, the amplitude of the Ca** current was light sensitive, but not 
the voltage dependence (Fig. 1b and c, left). The biphasic response 
was likely the composite of a slow decrease and a slow increase of the 
Ca** current, with the decrease occurring sooner. In the monophasic 
response, the current increase could still be present (see later), but 
was cancelled by a more delayed and superposing current decrease. 
The light-mediated modulation of the Ca** current can explain why 
no light response was detected at steady —30 mV or more negative 
membrane potentials, in that the current was not activated. At steady, 
more positive potentials, on the other hand, the Ca** current was 
probably inactivated’. We attempted to measure a light-induced 
change in intracellular Ca”* concentration with Fura-2 under cur- 
rent clamp, but, owing to spontaneous, wide swings in the membrane 
potential between positive and negative values (also observed by 
others”’*"* in dissociated horizontal cells) in darkness triggering 
spontaneous Ca”* concentration changes, such measurements were 
not feasible. 

The dark Ca** current and its light modulation tended to run 
down with time, possibly due to washout by the whole-cell-pipette 
solution. A second, identical flash almost invariably elicited a smaller 
response (for example, see Supplementary Fig. 5a), and a third flash 
gave an even smaller, or no, response. This run down and the res- 
ponse slowness precluded quantitative measurements such as the 
intensity—response relationship. If flashes were delivered in rapid 
succession, before the cell’s recovery from each flash, then multiple 
responses could be observed (Supplementary Fig. 2). With a step of 
light, the response was more prolonged as expected (not shown). 
Light did not affect the voltage-gated Na* current, the delayed- 
rectifier K* current, the inward-rectifier K' current (Fig. 1d), or 
the ionotropic glutamate- and GABA (y-aminobutyric acid)- 
induced currents*'® (Fig. le; also Supplementary Fig. 3). Thus, light 
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Figure 1| Light modulated the voltage-gated Ca”* current on catfish cone 
horizontal cells. a, Dissociated cone horizontal cell (left; arrow indicates 
axon-like process) and rod horizontal cell (right). b, Biphasic light response 
of some cone horizontal cells in Ca-Ringer and K-methanesulphonate 
pipette solution. Left: Ca** current J-V relationship before (black), shortly 
after (green), intermediately after (blue) and a long period after bright flash 
(red). Middle: time course of peak Ca‘ -current modulation by light. 
Vertical bars at top indicate flashes. Coloured arrows correspond to 
measurements on left panel. In this and subsequent figures, input resistance 
of the recorded cell (to monitor recording stability) is indicated at the top by 
the membrane current elicited by repetitive, 200 ms, —3 mV voltage steps. 
Time 0 designates onset of first light stimulus, as in other panels. Light 
stimulus was a spot (40 jim diameter, 480 nm) centred on soma, delivering 
3 x 10° photons um”? s | for 1 (first) and 30s (second). Right: collected 
results (n = 7) showing fractional change in Ca?“ current by light, magenta 
shows average. Flash (arrowhead) was 1 X 10°-3 X 10° photons um 7s! 
for 1s. c, Monophasic light response (decrease in Ca?* current) of other 
cone horizontal cells. Same solutions and flash intensity as in b. Collected 
results (nm = 5) and average (magenta) are shown in the right panel. d, Light 
had no effect on the other voltage-gated currents. Membrane potential 
stepped from —50 to 0 mV for Na’ current (n = 8), to +60 mV for delayed- 
rectifier K* current (n = 7), and to — 100 mV for inward-rectifier K* current 
(n = 8). Same flash intensity as above but for 2-30 s. Other wavelengths were 
also tried (400, 440, 540, 600 and 680 nm), but gave no responses either (not 
shown). K-methanesulphonate pipette solution. e, Light had no effect on 
glutamate- and GABA-induced currents. Same light as in d. Glutamate 
current recorded in Mg” * -free catfish Ringer and K-methanesulphonate 
pipette solution, n = 6; GABA current recorded in normal catfish Ringer and 
KCl pipette solution, n = 8. Both pipette solutions had 20 uM Cd?* to block 
Ca’* current. Viigo —50 mV. Glut, glutamate. 
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modulated specifically the Ca** current. We found the same light 
effect when all methanesulphonate (standard anion in the pipette 
solution and impermeant through Cl” channels) was replaced by 
Cl” to give a Cl” equilibrium potential of ~0 mV, which is evidence 
against Cl” involvement in the light response. Finally, although larger 
than cone horizontal cells, dissociated rod horizontal cells had a 
smaller Ca?* current® (30 + 11 pA, n= 3) in darkness and showed 
no detectable photosensitivity (data in normal solution not shown, 
but see Fig. 2c). 

Substitution of extracellular Ba?* for Ca** (Methods) increased 
the current through the Ca** channels on cone horizontal cells by 
~80% (from 217 + 41 pA to 400 + 128 pA, n= 5) and slowed its run 
down'*'’. The associated light response was also larger. Interestingly, 
light produced only an increase but no initial decrease in Ba”* cur- 
rent, which lasted many minutes; again, the amplitude but not the 
voltage dependence was affected (Fig. 2a). Even a cone horizontal cell 
that gave only a light-induced decrease in Ca** current in normal 
solution showed a light-induced increase in Ba*~ solution (Fig. 2a). 
Altogether, we observed a light response in Ba*~ solution in 108 out 
of 132 recorded cone horizontal cells, with a ‘threshold’ flash intens- 
ity of ~1 X 10° photons pm * (480 nm) for eliciting any detectable 
response. The dark Ba”* current and its modulation by light was 
blocked by 10uM nifedipine (Fig. 2b, n= 6), suggesting L-type 
Ca** channels were involved and consistent with their presence in 
catfish cone horizontal cells”!*. Again, only cone horizontal cells, but 
not rod horizontal cells (even with greater light intensity and at 
different wavelengths), showed light-mediated modulation of the 
Ba’* current (Fig. 2c), although the Ba** current on rod horizontal 
cells was also nifedipine sensitive (Supplementary Fig. 4). 

A light-induced decrease in Ca** current, but not Ba?™ current, 
suggested that a Ca” influx through the Ca”* channel was necessary. 
Indeed, a light-induced current decrease reappeared when Ca” was 
added to the Ba** solution (Fig. 2d, left). Also, the strong Ca** 
buffering in our typical pipette solution (10mM BAPTA and no 
Ca**; see Methods) did not eliminate the light-induced decrease in 
Ca** current, whereas removing all Ca”* buffering in the pipette did 
not make much difference either, although the light-induced Ca** 
current decrease seemed faster and larger (Fig. 2d, right). Thus, Ca** 
might act very close to the Ca”* channel, where it entered the cell, to 
affect a light-sensitive target. The possibility that the light-induced 
current decrease originated instead from the opening of a Ca**- 
activated K* current'® is unlikely, because this is inconsistent with 
the constant form of the I-V relationship in light and darkness as we 
observed. 

We probed the spectral sensitivity of the cone horizontal-cell light 
response. We used only the Ba’* solution because of the faster run- 
down of the Ca** current; thus, we could examine only the light- 
induced current increase. By reiterated pair-wise comparisons 
between two wavelengths in a given cell (Supplementary 
Information and Supplementary Fig. 5), we inferred a spectral- 
effectiveness ordering of 400 < 440 < 480 > 520 > 540 >560> 
600 > 680 nm. Thus, the 2,4. of the underlying pigment is between 
440 and 520 nm. This ordering is tentative because the method in 
principle applies only to bleachable pigments, which nonetheless 
include all known vertebrate pigments except possibly for melan- 
opsin (see below), the bleachability of which remains unclear'’. 

Some teleosts have been reported to express melanopsin or verte- 
brate ancient (VA) opsin, or both, in their horizontal cells**'*?. VA 
opsin is a pigment of largely unknown function. By degenerate poly- 
merase chain reaction with reverse transcription (RT-PCR) based on 
known melanopsins and VA opsins (Methods), we isolated from 
catfish retina the complementary DNA clones for two distinct melan- 
opsin genes (catfish OPN4m1 and OPN4mz2, based on sequence com- 
parison with other fish melanopsins”’) and two alternatively spliced 
cDNAs ofa VA opsin gene (catfish VA and VAL, with VAL having a 
longer and different C terminus’’) (see Methods, Supplementary 
Fig. 6 and Supplementary Tables 1 and 2). In situ hybridization 
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Figure 2 | Characterization of catfish cone horizontal-cell light response. 
a, Light only increased current in Ba~" solution. Time gap ~2 min, identical 
flashes, K-methanesulphonate pipette solution. b, Ba~* current and its light- 
induced increase were both sensitive to nifedipine (Nif, 10 1M). Time gap 
~5 min. c, Ba’* current of cone horizontal cells (bottom), but not rod 
horizontal cells (top), was light sensitive. Left shows sample cell (black, 
before light; red, ~3 min after light). n = 5 (top) and n = 4 (bottom) in 
group data. Simplified Ba-Ringer and Cs-methanesulphonate pipette 


labelling on flat-mount retina (Fig. 3a, left) indicated that, based on 
its depth of location in the retinal thickness and other landmarks, the 
OPN4m1- and OPN4m2-positive cells were situated near the outer 
margin of the inner nuclear layer, where horizontal cells are situated. 
The same labelling in retinal cross-sections supported this conclusion 
(Fig. 3a, right, and Supplementary Fig. 7). OPN4m1 was found also in 
cells near the inner margin of the inner nuclear layer, probably 
amacrine cells and/or displaced ganglion cells. This distribution 
pattern of melanopsin mRNA was found in most other fishes 
examined**. VA/VAL opsin transcripts were found predominantly 
in the ganglion cell layer and less in the inner nuclear layer (Fig. 3b 
and Supplementary Fig. 7), therefore less likely in horizontal 
cells®'*"?. Thus, the intrinsic light response of catfish cone horizontal 
cells is possibly triggered by melanopsin, although the involvement of 
VA and VAL cannot be completely excluded. There are other putative 
pigments in the retina, including teleost multiple tissue (TMT) 
opsin’' and neuropsin (OPN5)”, but their exact locations remain 
unknown. 

We also examined goldfish retina. Dissociated horizontal cells 
from goldfish are identifiable by their relatively large size and flat 
somata with stellar processes'* (Fig. 4a). However, it is difficult to 
distinguish these horizontal cells further into subtypes I through IV, 
which have been identified in the whole goldfish retina”. In Ba?* 
solution, the only solution tried, we observed a light-induced, simi- 
larly slow current increase in most horizontal cells recorded (Fig. 4b, 
18 out of 24 cells; by 34 + 20 pA at peak, or 37 + 33% over that in 
darkness, triggered by 5 X 10° photonspm 7s! for 0.5-1.0s at 
480nm). The remaining horizontal cells gave no light responses 
(Fig. 4c). This percentage of photoresponsive goldfish horizontal 
cells was remarkably high, but could have resulted from the 
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solution in b and c. d, Ca** influx was required for light-induced current 
decrease. Left, Ca”* (3 mM) in Ba-Ringer restored light-induced current 
decrease. Collected results (n = 2) and average (magenta) are shown. 
K-methanesulphonate pipette solution. Right, omitting 10 mM BAPTA in 
pipette still gave light-induced current decrease (n = 4; magenta, average). 
K-methanesulphonate pipette solution and Ca’ ‘ -Ringer. In all cases, light 
was 480 nm, 0.8 X 108-1 X 10° photons um? s | for 1-50s. 


dissociation procedure biasing the yield of particular horizontal-cell 
subtypes. 

We have established the presence of intrinsically photosensitive 
horizontal cells in the fish retina. In catfish, these are specifically 
cone horizontal cells, with a flash threshold of perhaps 
~10° photons um~* (480nm), or 10*—10°-fold less sensitive than 
fish cones (see Supplementary Information), not greatly different 
from that of intrinsically photosensitive retinal ganglion cells in 
mouse’’. With this low sensitivity, it makes sense at least that the 
intrinsically photosensitive horizontal cells are associated with cone 
rather than rod circuitry. In compensation, the prolonged response 
(lasting for minutes) should make temporal summation extremely 
effective, especially for sensing steady ambient light. If the intrinsic 
photosensitivity exists all over the cell, the large surface area of hori- 
zontal cells will also capture ambient light quite effectively. The hori- 
zontal-cell light response in this study seems distinct from the 
supposed intrinsic light response previously reported by others® for 
in situ horizontal cells in the roach flat-mount retina. This other 
response’, although having an action-spectrum Amax (~477 nm) dis- 
tinct from those of roach rods and cones, was observed in only 43 out 
of more than 1,000 cells, was found in rod horizontal cells and, 
finally, disappeared in the presence of synaptic blockers, which made 
a definitive conclusion of intrinsic photosensitivity difficult. 

Unlike mammalian intrinsically photosensitive retinal ganglion 
cells, which mediate predominantly non-image-forming visual func- 
tions’, albeit not exclusively”, the intrinsically photosensitive hori- 
zontal cells are almost certainly involved primarily in image vision 
because horizontal cells are principal elements in the retinal circuitry 
generating the antagonistic surround of the bipolar cell’s receptive 
field. With the operating range of horizontal-cell membrane 
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Figure 3 | Melanopsin, but perhaps not VA/VAL opsin, was expressed in 
the horizontal cell layer of catfish retina. a, In situ hybridization of 
OPN4m1 and OPN4m2 mRNA. Left: digoxigenin (DIG) staining of flat- 
mount retina. Small, dark-blue areas surrounding hollow cores (presumably 
the nuclei) indicate stained cell cytoplasm. The right panel shows the same 
experiments on retinal cross-sections with DIG signal, propidium iodide 
(PI) nuclear staining to mark retinal layers, and the two merged as indicated. 
Catfish retinal layers are not very distinctive, with sparse nuclei in the inner 
nuclear layer’. b, In situ hybridization of VA/VAL opsin mRNA. Left: DIG 
staining of flat-mount retina. Dark-blue dots are the stained cells, situated 
close to the retina inner surface with blood vessels (arrows) evident in the 
same focal plane. Right: same experiments on retinal cross-sections. In both 
a and b, cells appear smaller in the flat-mount retina than in retinal cross- 
sections owing to some tissue shrinkage during processing. GCL, Ganglion 
cell layer; INL, inner nuclear layer; ONL, outer nuclear layer. 


potential under in situ conditions being typically —20 to —65 mV, 
the corresponding range for light modulation would be approxi- 
mately —20 to —30mV, within which the Ca** current is active. 
This narrow range means that the effect of light is probably subtle. 
Because of the extremely slow intrinsic horizontal-cell light response, 
ambient light may alter, on a slow timescale, the bipolar-cell recept- 
ive-field surround, such as by modulating the horizontal cells’ 
electrical coupling or synaptic properties. The relationships of intra- 
cellular Ca** concentration to horizontal-cell electrical coupling and 
to horizontal-cell synaptic-transmitter release in fish have been 
studied, but the details are nonetheless unsettled, especially on a slow 
timescale''’***. Many questions remain, such as by what mechanism 
light affects the horizontal-cell Ca** current, what specific function 
this intrinsic photosensitivity serves in the fish retina, and whether 
light has additional (and possibly electrically silent) effects in these 
horizontal cells. What is clear from this work on intrinsically photo- 
sensitive horizontal cells and previous work on intrinsically photo- 
sensitive retinal ganglion cells'’, however, is that the types of 
intrinsically photosensitive retinal neurons can be much more 
complex than previously thought. 
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Figure 4 | Many goldfish horizontal cells also showed light responses. 

a, Dissociated goldfish horizontal cells. Horizontal cells shown in left and 
middle panels responded to light, but the horizontal cell shown in the right 
one did not. b, Light increased Ba?” current in some horizontal cells. Left 
and middle panels show a representative cell (left cell in a). Right panel 
shows collected data (n = 5) and average (magenta). c, Non- 
photoresponsive horizontal cells under the same conditions as in b. Left and 
middle panels show a representative cell (right cell in a). Right panel shows 
collected data (n = 6) and average (magenta). In both b and ¢, the flash was 
5 X 10* photons um ” (480 nm), and simplified Ba-Ringer and 
Cs-methanesulphonate pipette solution were used. 


METHODS SUMMARY 

Retina dissociation and electrophysiology. The fish was killed and the eyes 
removed in dim red light. Subsequent procedures were performed under infra- 
red light. The retina-dissociation procedure for obtaining horizontal cells fol- 
lowed published methods'*”’. After visual identification, a horizontal cell was 
voltage-clamped with whole-cell, patch-clamp recording at 23°C. The access 
resistance (generally 20-40 MQ) was not compensated, but should introduce 
an error of under a few millivolts because the measured Ca”* current was small. 
Liquid-junction potentials were measured and corrected. Voltage ramps (45— 
100mVs_') were used to elicit Ca**/Ba’* currents in most experiments 
(Supplementary Fig. 8). Light stimuli were delivered via an optical fibre to the 
microscope, with strength varied by changing the intensity and/or duration of 
the flash. Perforated-patch recording (with nystatin, amphotericin B or grami- 
cidin) was also attempted, but the membrane under the pipette invariably rup- 
tured to become whole-cell recording in <30 min, as indicated by Lucifer yellow 
entering the cell from the pipette solution. With the light response being extre- 
mely slow, this time duration would allow at most one or two light responses. 
Accordingly, this method was not used. 

Cloning of melanopsin and VA/VAL opsin genes from catfish retina. CDNA 
was synthesized from catfish retina total RNA, and used as a template for nested 
degenerate PCR. Primers were designed based on the conserved regions in 
known melanopsin and VA opsin amino-acid sequences. Conventional 5’- 
and 3'-rapid amplification of cDNA ends (RACE) was performed to obtain 
full-length cDNA sequences. Products were cloned into a commercial vector 
and sequenced. Sequences with more than three identical clones were considered 
true. 

In situ hybridization of melanopsins and VA/VAL opsin mRNA in catfish 
retina. RNA probes were made against cloned catfish OPN4m1 and OPN4m2 
sequences, and against the C termini of catfish VA and VAL opsins. 
Conventional in situ hybridization method for whole-mount embryos was 
adapted for whole-mount staining of catfish retina. In situ hybridization of 
fresh-frozen retinal sections was also performed (see Methods). 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Retina dissociation and electrophysiology. Channel catfish (Ictalurus puncta- 
tus) about 1 ft long were purchased from Osage Catfisheries (Missouri) and 
maintained in 14/10h light/dark cycles (light on 7:00-21:00) at the Johns 
Hopkins School of Medicine animal facility. Experiments were done in daytime 
hours of 12:00-20:00. The procedure for dissociating the catfish retina was 
largely as published’*. Before experiments, a catfish dark-adapted for ~60 min 
was killed in dim-red light with an institution-approved protocol, the eyes were 
excised and the cornea, iris and lens removed. The posterior eyecup was incu- 
bated in catfish low-Ca** dissociation Ringer (123 mM NaCl, 0.3mM CaCl, 
4mM KCl, 2mM MgCh, 15 mM glucose, 10 mM HEPES, 1 mM sodium pyru- 
vate, pH 7.4) with 0.2-0.3 mg ml ' hyaluronidase type I-S (Sigma-Aldrich) for 
4 min, then in the same base dissociation solution with 20-28 U ml! papain 
(Fluka) for 4-6 min. Afterwards, the retina was peeled from the eyecup, cut into 
six to eight pieces and incubated further in base dissociation solution with 
4-20 Um! papain for 3-12 min. The retina pieces were rinsed four times in 
base dissociation Ringer with 0.1 mgml~! bovine serum albumin and trans- 
ferred into normal catfish Ringer (118mM NaCl, 3mM CaCl, 4mM KCl, 
2mM MgCl, 15mM glucose, 10 mM HEPES, pH 7.4) and kept on ice until 
use. When needed, horizontal cells were dissociated from pieces of tissue by 
mechanical trituration with a fire-polished Pasteur pipette. 

The dissociation procedure for goldfish (Carassius auratus) retina also fol- 
lowed a published procedure”’. Goldfish 4-6-inch long were purchased from 
local pet stores and maintained in 12/12 h light/dark cycles. Experimental pro- 
cedures were largely similar to above. Goldfish low-Ca’* dissociation Ringer 
contained 117mM NaCl, 0.5mM CaCl, 2.5mM KCl, 5mM MgCl, 16mM 
glucose, 10mM HEPES, 1mM sodium pyruvate, 1mM NaHCO;3, 1mM 
NaH,PO,, pH 7.3. Eyecups were prepared, and the retina was peeled off and 
incubated in hyaluronidase type I-S solution (Sigma-Aldrich, 1 mg ml! in base 
dissociation Ringer) for 10min. The retina was rinsed in base dissociation 
Ringer, cut into four to six pieces, transferred into papain solution (Fluka, 
1mgml ' in base dissociation Ringer) for 30 min, and rinsed four times in base 
dissociation Ringer containing 0.1 mg ml! bovine serum albumin. The retinal 
pieces were transferred to normal goldfish Ringer (122 mM NaCl, 2.5 mM KCl, 
2.5mM CaCl,, 1mM MgCl, 10 mM HEPES, 16 mM glucose, pH 7.3) and kept 
on ice until used as above. 

To pharmacologically isolate each of the four known voltage-gated ion cur- 
rents in catfish cone horizontal cells, the following blockers or their combinations 
were added, or equimolar-substituted, to the normal fish Ringer”'*. Tetrodotoxin 
(10 uM) was added to block the voltage-gated Na‘ current. Cd?* (20 pM) was 
added to block the voltage-gated Ca”* current. Tetraethylammonium ion (TEA- 
Cl, 20 mM) and 4-aminopyridine (10 mM) were equimolar-substituted for NaCl 
to block the delayed-rectifier K* current, and 1 mM Ba?* was added to block the 
inward-rectifier K* current. For example, the Ringer (‘Ca-Ringer’) for isolating 
the Ca”* current contained 94 mM NaCl, 3 mM CaCl, 4mM KCl, 2mM MgCh, 
15mM glucose, 10 mM HEPES, 10M tetrodotoxin, 20mM TEA-Cl, 10mM 
4-aminopyridine, 1 mM BaCl, pH 7.4. ‘Ba-Ringer’ had 3mM Ba?" in place of 
the 3mM Ca°", in addition to the 1 mM Ba’" to block the inward-rectifier K* 
current. The substitution of Ba** for Ca** shifted the L-V relationship by a few 
millivolts in the hyperpolarized direction’® (see Fig. 2a). Normal internal (that is, 
pipette-) solution contained 105mM K-methanesulphonate, 10mM BAPTA, 
10mM HEPES, 10mM NaCl, 4mM ATP-Mg, 0.3mM GTP-Na, pH7.3 
(‘K-methanesulphonate-based pipette solution’). In later experiments, to record 
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a stable Ba~* current for a longer period, we replaced all external Na* and K* 
with TEA*: 4mM BaCl,, 120 mM TEA-Cl, 3mM MgCl, 15 mM glucose, 10 mM 
HEPES, pH 7.4 (referred to as ‘simplified Ba-Ringer’), and substituted Cs* for 
K* in the pipette solution: 95mM Cs-methanesulphonate, 10mM BAPTA, 
10mM HEPES, 20mM TEA-Cl, 4mM ATP-Mg, 0.3mM GTP-Na, pH7.3 
(‘Cs-methanesulphonate-based_ pipette solution’)'’. Pipette solution without 
Ca** buffer contained: 120 mM K-methanesulphonate, 10 mM HEPES, 10 mM 
NaCl, 4mM ATP-Mg, 0.3mM GTP-Na, pH 7.3. ‘KCl-based pipette solution’ 
contained 100mM KCl, 10mM BAPTA, 10mM HEPES, 10mM NaCl, 4mM 
ATP-Mg, 0.3 mM GTP-Na, pH 7.3. All solutions had 260-265 mOsm. 

Membrane currents elicited with voltage ramps gave similar results to those 
with voltage steps (Supplementary Fig. 8). The Ca** current was measured from 
recordings as shown in Supplementary Fig. 9. 

The light stimulus was generally a 40-j1m-diameter spot centred on the soma 

of the cell. The light-spot size was chosen to illuminate enough membrane area 
yet to avoid other dissociated cells and debris. Larger light spots (with twice the 
diameter) were occasionally used, but gave qualitatively similar results. Bigger 
light spots (200 ttm diameter) were used for catfish rod horizontal cells because 
these cells were larger. The microscope did not allow changing the light-spot size 
or moving it to different parts of the cell during the experiment without dis- 
turbing the recording. 
Cloning melanopsins and VA/VAL opsin from catfish retina. Total RNA was 
isolated from catfish retina using TRIzol Reagent (Invitrogen), and reverse- 
transcribed into cDNA with SuperScript II reverse transcriptase (Invitrogen). 
Degenerate PCR primers were manually designed from a highly conserved 
region of 8-10 amino acids. Degeneracy was generally kept below 100-fold 
and idenosine was used when necessary in the 5’ region of the primers. A second 
set of primers were designed using the program CODEHOP (consensus-degen- 
erate hybrid oligonucleotide primers: http://blocks.fhcrc.org/help/CODEHOP/ 
CODEHOP_help.html). Nested degenerate PCR was performed with Taq poly- 
merase. Phusion DNA polymerase (Finnzymes) was used for PCR in 5’- and 
3'-RACE. 

The DNA sequences of degenerate PCR products were translated into amino- 
acid sequences and aligned with those of known melanopsins and VA opsins 
using Vector NTI (Invitrogen). The translated sequences were also used as tem- 
plates to search for similar sequences in the genome database using NCBI Blast. 

The accession numbers for catfish OPN4m1 and OPN4m2 are, respectively, 

FJ839437 and FJ839438. The accession numbers for VA and VAL opsins are 
FJ839435 and FJ839436, respectively. 
In situ hybridization of melanopsins and VA/VAL opsin mRNAs in catfish 
retina. T7 or Sp6 RNA polymerase (New England Biolabs) was used to synthesize 
both sense and antisense RNA probes tagged with digoxigenin. For whole- 
mount staining, retinas were fixed in 4% paraformaldehyde, digested by 
proteinase K, post-fixed, washed with PBS, pre-hybridized and then hybridized 
overnight at 68 °C, followed by post-hybridization steps. Anti-digoxigenin alka- 
line phosphatase was used for colour development. Cross-sections (25 ,1m thick) 
of fresh-frozen eyecups were made using a cryostat and processed to examine 
more clearly in which layer(s) the DIG signal was located. 


30. Linn, C. L. & Gafka, A. C. Activation of metabotropic glutamate receptors 
modulates the voltage-gated sustained calcium current in a teleost horizontal cell. 
J. Neurophysiol. 81, 425-434 (1999). 
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Gain-of-function of mutated C-CBL tumour 
suppressor in myeloid neoplasms 


Masashi Sanada’”*, Takahiro Suzuki’*, Lee-Yung Shih®*, Makoto Otsu’, Motohiro Kato’, Satoshi Yamazaki®, 
Azusa Tamura’, Hiroaki Honda’’, Mamiko Sakata-Yanagimoto’”, Keiki Kumano®, Hideaki Oda’’, 

Tetsuya Yamagata’, Junko Takita’**, Noriko Gotoh'®, Kumi Nakazaki’*, Norihiko Kawamata’”, 

Masafumi Onodera’®, Masaharu Nobuyoshi’, Yasuhide Hayashi’’, Hiroshi Harada'®, Mineo Kurokawa™, 
Shigeru Chiba’, Hiraku Mori'®, Keiya Ozawa’, Mitsuhiro Omine'®, Hisamaru Hirai’, Hiromitsu Nakauchi®”, 


H. Phillip Koeffler’? & Seishi Ogawa’? 


Acquired uniparental disomy (aUPD) is a common feature of cancer 
genomes, leading to loss of heterozygosity. aUPD is associated not 
only with loss-of-function mutations of tumour suppressor genes’, 
but also with gain-of-function mutations of proto-oncogenes”. Here 
we show unique gain-of-function mutations of the C-CBL (also 
known as CBL) tumour suppressor that are tightly associated with 
aUPD of the 11q arm in myeloid neoplasms showing myeloproli- 
ferative features. The C-CBL proto-oncogene, a cellular homologue 
of v-Cbl, encodes an E3 ubiquitin ligase and negatively regulates 
signal transduction of tyrosine kinases**. Homozygous C-CBL 
mutations were found in most 11q-aUPD-positive myeloid malig- 
nancies. Although the C-CBL mutations were oncogenic in NIH3T3 
cells, c-Cbl was shown to functionally and genetically act as a 
tumour suppressor. C-CBL mutants did not have E3 ubiquitin ligase 
activity, but inhibited that of wild-type C-CBL and CBL-B (also 
known as CBLB), leading to prolonged activation of tyrosine kinases 
after cytokine stimulation. c-Cbl-’" haematopoietic stem/progeni- 
tor cells (HSPCs) showed enhanced sensitivity to a variety of cyto- 
kines compared to c-Cbl*’* HSPCs, and transduction of C-CBL 
mutants into c-Cbl-‘" HSPCs further augmented their sensitivities 
to a broader spectrum of cytokines, including stem-cell factor (SCF, 
also known as KITLG), thrombopoietin (TPO, also known as 
THPO), IL3 and FLT3 ligand (FLT3LG), indicating the presence 
of a gain-of-function that could not be attributed to a simple loss- 
of-function. The gain-of-function effects of C-CBL mutants on cyto- 
kine sensitivity of HSPCs largely disappeared in a c-CbI*’* back- 
ground or by co-transduction of wild-type C-CBL, which suggests 
the pathogenic importance of loss of wild-type C-CBL alleles found 
in most cases of C-CBL-mutated myeloid neoplasms. Our findings 
provide a new insight into a role of gain-of-function mutations of a 
tumour suppressor associated with aUPD in the pathogenesis of 
some myeloid cancer subsets. 

Myelodysplastic syndromes (MDS) are heterogeneous groups of 
blood cancers originating from haematopoietic precursors. They are 


characterized by deregulated haematopoiesis showing a high pro- 
pensity to acute myeloid leukaemia (AML)’. Some MDS cases have 
overlapping clinico-pathological features with myeloproliferative 
disorders, and are now classified into myelodysplasia/myeloprolifera- 
tive neoplasms (MDS/MPN) by the World Health Organization 
(WHO) classification*. To obtain a comprehensive profile of allelic 
imbalances in these myeloid neoplasms, we performed allele-specific 
copy number analyses of bone marrow samples obtained from 222 
patients with MDS, MDS/MPN, or other related myeloid neoplasms 
(Supplementary Tables 1 and 2) using high-density single nucleo- 
tide polymorphism (SNP) arrays combined with CNAG/AsCNAR 
software”"®. 

Genomic profiles of MDS and MDS/MPN showed characteristic 
unbalanced genetic changes, as reported in previous cytogenetic 
studies"' (Supplementary Fig. 1a); however, they were detected more 
sensitively by SNP array analyses (Supplementary Table 3). aUPD 
was detected in 70 samples (31.5%) on the basis of the allele-specific 
copy number analyses, which substantially exceeded the detection 
rate obtained using a SNP call-based detection algorithm (20.7%) 
(Supplementary Figs 2 and 4, and Supplementary Tables 4 and 5). 
Long stretches of homozygous SNP calls caused by shared identical- 
by-descent alleles in parents were empirically predicted and excluded 
(Supplementary Fig. 3). aUPDs were more common in MDS/MPN 
than in MDS. They preferentially affected several chromosomal arms 
(1p, 1q, 4q, 7q, 11p, 11q, 14q, 17p and 21q) in distinct subsets of 
patients, and frequently associated with mutated oncogenes and 
tumour suppressor genes (Supplementary Figs 1b and 5). Among 
these, the most common aUPDs were those involving 11q 
(n= 17), which defined a unique subset of myeloid neoplasms that 
were clinically characterized by frequent diagnosis of chronic mye- 
lomonocytic leukaemia (CMML) with normal karyotypes (13 cases) 
(Fig. la and Supplementary Table 6). We identified a minimum 
overlapping aUPD segment of approximately 1.4 megabases (Mb) 
in 11q, which contained a mutated C-CBL proto-oncogene (Fig. 1b). 
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Supplementary Tables 6 and 7), as also indicated in a recent report’. 
Thus, C-CBL was thought to be the major, if not the only, target of 
11lq-aUPD in myeloid neoplasms. Two different C-CBL mutations 
co-existed in three cases (Supplementary Fig. 6b). Somatic origins of 
the mutations were confirmed in three evaluable cases (Sup- 
plementary Fig. 6c). 

In most cases, C-CBL mutations were missense, involving the evo- 
lutionarily conserved amino acids within the linker-RING finger 
domain that is central to the E3 ubiquitin ligase activity’” (Fig. 1c). 
Another case with a predominant Cys384Tyr mutation also con- 
tained a nonsense mutation (Arg343X) in a minor subclone, which 
resulted in a v-Cbl-like truncated protein (Supplementary Fig. 6b). In 
the remaining two cases, mutations led to amino acid deletions 
(A369—371 and A368—382) involving the highly conserved o-helix 
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Figure 1| Common UPD on the 11q arm and C-CBL mutations in myeloid 
neoplasms. a, Copy number profiles of 17 cases with myeloid neoplasms 
showing 11qUPD. Regions of copy number (CN) gains, losses and aUPD are 
depicted in different colours. Histologies are shown by coloured boxes. 
Asterisks denote C-CBL-mutated cases. Ch, chromosome; RAEB, refractory 
anaemia with excess blasts. b, CNAG output for MDS.U35. Total copy 
number (tCN) and allele-specific copy number (asCN) plots show a focal 
copy number gain spanning a 1.4-Mb segment within 3 Mb of an 11q-aUPD 
region (left), which contained mutated C-CBL in MDS.U35 (right). 

c, Alignments of amino acid sequences for human CBL family proteins and 
their homologues in Caenorhabditis elegans (SLI-1) and Drosophila (D-Cbl). 
Amino acid numbering is on the basis of human C-CBL. Conserved amino 
acids are highlighted. Positions of mutated amino acids are indicated by 
asterisks. Heterozygous mutations are shown in red. d, Mutated amino acid 
positions in the three-dimensional structure of a human C-CBL-UBE2L3 
complex. TKB, tyrosine kinase binding domain. 


C-CBLis the cellular homologue of the v-Cbl transforming gene of 
the Cas NS-1 murine leukaemia virus””’. It was recently found to be 
mutated in human AML cases'*"°. Together with its close homo- 
logue, CBL-B, C-CBL is thought to be involved in the negative modu- 
lation of tyrosine kinase signalling, primarily through their E3 
ubiquitin ligase activity that is responsible for the downregulation 
of activated tyrosine kinases*>. By sequencing all C-CBL exons in all 
222 samples, we found C-CBL mutations in 15 of the 17 cases with 
11q-aUPD, whereas only 3 out of 205 cases without 11q-aUPD had 
C-CBL mutations, showing a strong association of C-CBL mutations 
with llq-aUPD (P= 1.46 Xx 107'%) (Supplementary Fig. 6 and 


(aL) of the linker domain and the first loop of the RING finger. 
According to the published crystal structure of C-CBL'’, most of 
the mutated or deleted amino acids were positioned on the interface 
for the binding to the E2 enzyme (Fig. 1d), making contact with 
either the tyrosine kinase binding domain (Tyr 368 and Tyr 371) or 
E2 ubiquitin-conjugating enzymes (Ile 383, Cys 404 and Phe 418). 
Especially, all seven linker-domain mutations selectively involved 
just three amino acids (Gln 367, Tyr 368 and Tyr 371) within the 
conserved o&L helix (Fig. 1d). Mutations were clearly homozygous 
in nine cases, and the apparently heterozygous chromatograms in 
the other six cases could also be compatible with homozygous 
mutations affecting the aUPD-positive tumour clones, given the 
presence of substantial normal cell components within these 
samples. Mutations in the remaining three cases were considered 
to be heterozygous. About half of the C-CBL-mutated cases carried 
coexisting mutations of RUNX1 (four cases), TP53 (one case), FLT3 
internal tandem duplication (1 case) or JAK2 (3 cases). NRAS and 
KRAS mutations were prevalent among CMML (15.1%) but 
occurred within discrete clusters from C-CBL-mutated cases 
(Supplementary Tables 2 and 6 and Supplementary Fig. 5). The 
mutation status of C-CBL did not substantially affect the clinical 
outcome (Supplementary Fig. 7). 

All tested C-CBL mutants induced clear oncogenic phenotypes in 
NIH3T3 fibroblasts, as demonstrated by enhanced colony formation 
in soft agar and tumour generation in nude mice (Supplementary 
Fig. 8). Transformed NIH3T3 cells showed PI3 kinase-dependent 
activation of Akt and the transformed phenotype was reverted by 
treatment with the PI3 kinase inhibitor Ly294002 (Supplementary 
Fig. 9). When introduced into Lin~ Scal* c-Kit* (LSK) HSPCs, 
C-CBL mutants (C-CBL(G1n367Pro) and C-CBL(Tyr37 Ser) ), as well 
as a mouse lymphoma-derived oncogenic mutant (C-CBL(70Z)), sig- 
nificantly promoted the replating capacity of these progenitors 
(Fig. 2a). Because c-Cbl negatively modulates tyrosine kinase signal- 
ling, and all C-CBL mutations, including those previously 
reported'*”°, affected the critical domains for its enzymatic activity 
involved in this modulation, C-CBL was postulated to have a tumour 
suppressor function; loss-of-function could be a mechanism for the 
oncogenicity of these C-CBL mutants*». To assess this possibility and 
to clarify further the role of C-CBL mutations in the pathogenesis of 
myeloid neoplasms, we generated c-Cbl ‘~ mice and examined their 
haematological phenotypes (Supplementary Fig. 10). 

In agreement with previous reports'*°, c-Cbl‘~ mice exhibited 
splenomegaly and an augmented haematopoietic progenitor pool, as 
was evident from the increased colony formation of bone marrow 
cells in methylcellulose culture and higher numbers of LSK and 
CD34-negative LSK cells in bone marrow and/or spleen compared 
to their wild-type littermates (Fig. 2b—d and Supplementary Fig. 11). 
Furthermore, when introduced into a BCR-ABL transgenic back- 
ground”!, the c-Cbl '~ allele accelerated blastic crisis depending on 
the allele dosage (Fig. 2e, f). These observations supported the notion 
that wild-type C-CBL has tumour suppressor functions, whereas 
‘mutant’ C-CBL acts as an oncogene; C-CBL can therefore be both 
a proto-oncogene and a tumour suppressor gene. 
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Figure 2 | Tumour-suppressor functions of wild-type C-CBL. a, Prolonged 
replating capacity of LSK cells transduced with mutant C-CBL (C- 
CBL(G1n367Pro) and C-CBL(Tyr371Ser)), compared to mock- or wild-type 
C-CBL-transduced cells. Replating capacity in methylcellulose culture is 
shown as mean colony number (and s.d.) per 1,000 replating cells at 
indicated times of replating. p, passage. b, Increased spleen mass in c-Cbl/~ 
mice compared to c-Cbl*/* mice (mean spleen weight and s.d.). ¢, Mean 
number of total LSK (left) and CD34-negative LSK (right) cells (plus s.d.) in 
bone marrow (BM) and/or spleen in c-CbI*'* (blue columns) and c-Cbl/~ 
mice (red columns). Bone marrow cells from bilateral tibias and femurs were 
counted for each mouse. d, Augmented colony-forming potential of bone 
marrow cells from c-Cbl-‘~ mice (mean colony number and s.d. per 5,000 
bone marrow cells). CFU, colony-forming units. e, Kaplan—Meier survival 
curves of c-Cbl*/*, c-Cbl*’~ and c-Cbl ’ mice carrying a BCR-ABL 
transgene, showing acceleration of blastic crisis in c-Cbl*’~ and c-Cbl “~ 
mice. f, Wright—Giemsa staining of an enlarged lymph node in a Ber-Abl* 
c-Cbl ‘~ mouse during blastic crisis shows massive infiltrates of immature 
leukaemic blasts. Original magnification, < 600. 


Mouse LSK HSPCs expressed two Cbl family member proteins: 
wild-type c-Cbl and Cbl-b (Supplementary Fig. 12)”. When trans- 
duced into NIH3T3 cells stably expressing human epidermal growth 
factor receptor (EGFR), both Cbl proteins enhanced ubiquitination 
of EGFR after EGF stimulation, which was suppressed by coexpres- 
sion of the C-CBL mutants (Fig. 3a, b). In haematopoietic cells, over- 
expression of wild-type C-CBL enhanced _ ligand-induced 
ubiquitination of a variety of tyrosine kinases, including c-KIT, 
FLT3 and JAK2. In contrast, C-CBL mutants not only showed com- 
promised enzymatic activity, but also inhibited the ubiquitinating 
activities in these haematopoietic cells (Fig. 3c), leading to prolonged 
tyrosine kinase activation after ligand stimulation (Fig. 3d). 
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Figure 3 | Inhibitory actions of C-CBL mutants on wild-type C-CBL. 

a, b, Flag-tagged wild-type C-CBL (a) or CBL-B (b) were transfected into 
NIH3T3 cells stably transduced with human EGER plus indicated HA- 
tagged C-CBL mutants. Anti-ubiquitin blots of immunoprecipitated EGFR 
after EGF stimulation show the inhibitory actions of the C-CBL mutants on 
ubiquitinating activity of C-CBL (a) and CBL-B (b). Bottom panels are anti- 
HA and anti-Flag blots of total cell lysates. c, Effects of wild-type and mutant 
C-CBL on cytokine-induced ubiquitination of c-KIT, JAK2 and FLT3 in 
haematopoietic cells V3MC, BaF3 co-transduced with human 
erythropoietin receptor (EPOR) and JAK2 (BaF3/EPOR/JAK2), and FLT3- 
transduced 32D (32D/FLT3), respectively. Each cell line was further 
transduced with indicated C-CBL mutants, and ubiquitination of 
immunoprecipitated kinases was detected by anti-ubiquitin blots at 1 min 
after stimulation with SCF, EPO and FLT3LG. Anti-kinase blots of the 
precipitated kinases are shown below each panel. d, Kinase phosphorylation 
was examined at indicated time points (shown in minutes) after ligand 
stimulation using immunoblot analyses of total cell lysates using antibodies 
to phosphorylated (p-) EGFR, c-KIT, JAK2 and FLT3 in which anti-c- 
tubulin or anti-GAPDH blots are provided as a control. 


Because tyrosine kinase signalling is central to cytokine responses 
in haematopoietic cells and its deregulation is a common feature of 
myeloproliferative disorders”, we next examined the effects of 
C-CBL mutations (C-CBL(GIn367Pro) and C-CBL(Tyr371Ser)) 
and the loss of wild-type C-CBL alleles on the responses of LSK 
HSPCs to various cytokines. In serum-free conditions, c-Cbl-'~ 
LSK cells showed a modestly enhanced proliferative response to a 
variety of cytokines, including SCF, IL3 and TPO, compared to 
c-Cbl*’* cells (mock columns in Fig. 4a). However, the enhanced 
response in c-Cbl '~ cells was markedly augmented and extended to 
a broader spectrum of cytokines, including FLT3 ligand by the trans- 
duction of C-CBL mutants. Of note, the effect of C-CBL mutant 
transduction was not remarkable in c-CbI*/* LSK cells except for 
the response to SCF, which was clearly enhanced by C-CBL mutants 
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Figure 4 | Gain-of-function of mutant C-CBL augmented by loss of wild- 
type C-CBL. a, c-Cbl*’* and c-Cbl '~ LSK cells were transfected with 
various C-CBL internal ribosome entry site (IRES)/green fluorescent protein 
(GFP) constructs, and 50 GFP-positive cells were sorted for serum-free 
culture containing indicated concentrations of SCF, IL3, TPO and FLT3LG. 
Mean cell numbers (plus s.e.m.) on day 5 are plotted. b, c-Cbl”'~ LSK cells 
were co-transduced with C-CBL(GIn367Pro)-IRES-EGFP (C-CBL(Q367P)/ 
EGFP) and mock-IRES-Kusabira-Orange (mock/KO) or wild-type C-CBL- 
IRES-Kusabira-Orange (C-CBL(WT)/KO), and 50 GFP/KO double-positive 


even witha c-Cbl*/* background (Fig. 4a and Supplementary Fig. 13). 
To clarify further the effect of wild-type C-CBL on C-CBL mutants, 
both wild-type C-CBL and C-CBL mutants were co-transduced into 
c-Cbl '~ LSK cells, and their effects on the response to SCF were 
examined. As shown in Fig. 4b, the hyperproliferative response 
induced by C-CBL mutants was almost completely abolished by the 
co-transduction of wild-type C-CBL, suggesting the pathogenic 
importance of loss of wild-type C-CBL alleles found in most 
C-CBL-mutated cases. LSK cells transduced with C-CBL mutants also 
showed enhanced activation of the STAT5 and Akt pathways on cyto- 
kine stimulation (SCF and TPO), which was more pronounced in 
c-Cbl ‘~ than c-Cbl*'* LSK cells (Fig. 4c and Supplementary Fig. 14). 

The modest enhancement of sensitivity to cytokines found in 
c-Cbl ‘~ LSK cells was a consequence of loss of C-CBL functions. 
In contrast, the hypersensitive response of mutant-transduced 
c-Cbl‘~ LSK cells to a broad spectrum of cytokines represents 
gain-of-function of the mutants that could not be ascribed to a 
simple loss of C-CBL functions, which was also predicted from the 
strong association of C-CBL mutations with 11q-aUPD by analogy to 
the gain-of-function JAK2 mutations associated with 9p-aUPD in 
polycythemia vera’. The gain-of-function of C-CBL mutants became 


cells were sorted into each well for cell proliferation assays in serum-free 
culture containing 10 ng ml ' SCF. Mean cell numbers on day 5 (plus s.e.m., 
n = 5) are plotted. c, Ten thousand c-Cbl*!* and c-Cbl”'~ LSK cells 
transduced with various C-CBL constructs were stimulated with 10 ng ml! 
SCF and 10ngml ' TPO for 15 min. Total cell lysates were analysed by 
immunoblotting, using antibodies to STAT5, Akt and their phosphorylated 
forms. The intensities of phosphorylated proteins relative to total STAT5 
(top panel) and Akt (bottom panel) are plotted. NC indicates the mean 
background signal obtained with nonspecific IgG. 


more evident under a c-Cbl ‘~ background. The hypersensitive res- 
ponse to cytokines induced by mutant C-CBL under the c-Cbl/~ 
background was largely offset by the presence of the wild-type c-Cbl 
allele or by the transduction of the wild-type C-CBL gene, suggesting 
that the gain-of-function could be closely related to loss of C-CBL- 
like functions, probably by inhibition of Cbl-b. Supporting this view 
is a previous report that c-Cbl/Cbl-b double knockout T cells showed 
more profound impairments in the downregulation of the T-cell 
receptor (TCR), more sustained TCR signalling, and more vigorous 
proliferation, than c-Cbl or Cbl-b single knockout T cells after anti- 
CD3 (also known as CD3e) stimulation”. This is analogous to the 
gain-of-function found in some TP53 mutants, which has been 
explained by functional inhibition of two TP53 homologues, TP73 
and TP63 (refs 25, 26). Of note, TP53 was also originally isolated as an 
oncogene through its mutated forms’. The Cbl-b inhibition-based 
gain-of-function model could be tested directly by comparing the 
behaviour of c-Cbl/CbI-b double knockout LSK cells with that of LSK 
cells carrying homozygously knocked-in mutant C-CBL alleles. On 
the other hand, there remains a possibility that the gain-of-function 
could be mediated by a mechanism other than the simple inhibition 
of the homologue, because C-CBL mutants retained several motifs 
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that interacted with numerous signal-transducing molecules. 
Furthermore, considering the ubiquitous expression of CBL 
proteins, it would be of interest to explore the possible involvement 
of mutations in all CBL family members in other human cancers. 


METHODS SUMMARY 


Genomic DNA from 222 bone marrow samples with myeloid neoplasms were 
analysed using GeneChip SNP-genotyping microarrays (Affymetrix GeneChip) 
as described”*. Allelic imbalances were detected from the allele-specific copy 
numbers calculated using CNAG/AsCNAR software (http://www.genome. 
umin.jp)”'®. C-CBL mutations were examined by sequencing PCR-amplified 
genomic DNA. For functional assays, haemagglutinin (HA)- or Flag-tagged 
complementary DNAs of wild-type and mutant C-CBL were generated by in 
vitro mutagenesis, constructed into a MSCV-based retroviral vector, 
pGCDNsamIRESGFP or pGCDNsamIRESKO, and used for retrovirus- 
mediated gene transfer. For the evaluation of oncogenicity of C-CBL mutants, 
NIH3T3 cells were transfected with various C-CBL constructs and used for 
colony assays in soft agar and tumour formation assays in nude mice. c-Cbl- 
deficient mice were generated using a conventional strategy of gene-targeting 
and crossed with BCR-ABL transgenic mice to evaluate the effect of the c-Cbl” 
allele on the acceleration of blastic crisis. LSK cells sorted from c-CbI'/* and 
c-Cbl /~ mice were transduced with various C-CBL constructs. Their responses 
to cytokines were evaluated by cell proliferation assays, followed by immunoblot 
analyses of c-KIT, FLT3 and JAK2, as well as their downstream signalling mole- 
cules. The effects of C-CBL mutant expression on the ubiquitination of EGFR, 
c-KIT, FLT3 and JAK2 were examined by transducing C-CBL mutants into 
relevant cells, followed by anti-ubiquitin blots of the immunoprecipitated 
kinases after ligand stimulation. Functional competition of C-CBL mutants 
with wild-type C-CBL was assessed by cell proliferation assays of LSK cells co- 
transduced with both wild-type and mutant C-CBL genes. This study was 
approved by the ethics boards of the University of Tokyo, Chang Gung 
Memorial Hospital and Showa University. Antibodies and primers used in this 
study are listed in Supplementary Tables 8 and 9. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Genome-wide analysis of allelic imbalances in primary myeloid neoplasms. 
Bone marrow specimens were obtained from 222 patients diagnosed with myeloid 
neoplasms according to the WHO classification (Supplementary Tables 1 and 2). 
High molecular weight genomic DNA was extracted and used for microarray 
analysis using Affymetrix GeneChip 50K Xbal, HindIII or 250K Nspl, according 
to the manufacturer’s instructions. Genome-wide detection of allelic imbalances 
was performed using CNAG/AsCNAR software  (http://www.genome. 
umin.jp)*”°. 

Mutation analysis. Mutation analysis was performed by direct sequencing of 
PCR-amplified coding exons of the relevant genes, using an ABI PRISM 3100 
genetic analyser (Applied Biosystems). The target genes, exons and PCR primers 
are listed in Supplementary Table 8. Tandem duplication of the FLT3 gene was 
examined by genomic PCR and sequencing. 

Preparation of high-titre vesicular stomatitis virus glycoprotein (VSV-G)- 
pseudotyped retroviral particles. HA-tagged human C-CBL cDNA was a gift 
from W. Y. Langdon. Nine mutant cDNAs of C-CBL, including eight from patients’ 
specimens and a 70Z mutant corresponding to a mutant isolated from mouse 
lymphoma”, were generated on the basis of this construct, using a QuickChange 
site-directed mutagenesis kit (Stratagene). These were then constructed into the 
retrovirus vectors pGCDNsamIRESGFP and pGCDNsamIRESKO****. Vector 
plasmids were co-transfected with a VSV-G cDNA into 293GP cells (provided by 
R.C. Mulligan) to obtain retrovirus-containing supernatant, which was then trans- 
duced into 293GPG cells to establish stable cell lines capable of producing VSV-G- 
pseudotyped retroviral particles on induction****. The average titre of retrovirus 
stocks prepared from these cell lines routinely exceeded approximately 1-10 X 10” 
inclusion-forming units per ml, as estimated using Jurkat cells. 

Assays for anchorage-independent growth and tumorigenicity in nude mice. 
NIH3T3 cells (the Japan Cell Resource Bank) were stably transduced with wild- 
type and mutant C-CBL by retrovirus-mediated gene transfer. For colony forma- 
tion assays, 1.0 X 10° stable cells for each construct were inoculated in 0.33% top 
agar, and the numbers of colonies >1 mm in diameter were counted 3 weeks 
after inoculation (n= 8). Experiments were repeated four times. For tumour 
formation in nude mice, 1.0 X 10” stable cells were inoculated subcutaneously at 
two sites per mouse. Cells were inoculated at six sites in three mice for each 
construct. 

Purification of LSK HSPCs. LSK HSPCs were purified from bone marrow and 
spleen as described****. Multicolour flow cytometry analysis and cell sorting were 
performed using a MoFlo cell Sorter (Beckman Coulter). The purity of sorted cell 
fractions consistently exceeded 98%. 

Replating assays of bone marrow progenitor cells. Bone marrow LSK cells were 
infected with IRES/GFP-containing retrovirus carrying mock, wild-type C-CBL 
and three C-CBL mutants (C-CBL(Gln367Pro), C-CBL(Tyr371Ser) and 
C-CBL(Cys384Gly)) as well as C-CBL(70Z) on RetroNectin-coated dishes. 
After 48 h infection in culture in StemSpan supplemented with SCF (50 ng ml '; 
Peprotech), TPO (20 ng ml ') and FLT3LG (20 ngml- ') 1.0 X 107 GFP-positive 
cells were inoculated in MethoCult M3231 supplemented with TPO (20 ng ml '), 
IL3 (10 ng ml '), IL6 (10 ng ml '), FLT3LG (10 ng ml!) and SCF (50 ng ml — 
for colony formation. Colony-forming cells were collected 7 days after each inocu- 
lation, from which 1.0 X 10° cells were repeatedly subjected to replating until no 
colonies were produced. Experiments were repeated at the indicated times for 
each C-CBL construct. 

Generation of c-CbI_‘~ mice and evaluation of their tumour-prone phenotype. 
c-Cbl-‘~ mice were generated using a conventional method of gene targeting 
(Supplementary Fig. 10). c-Cbl*’*, c-Cbl*’~ and c-Cbl-‘~ mice were crossed 
with BCR-ABL transgenic mice, and their survival and the development of blastic 
crises were monitored. 

Evaluation of haematopoietic pool size in c-CbI_‘~ mice. LSK and CD34_ LSK 
cells were sorted from bone marrow cells or spleens of c-Cbl '~ mice, and their 
numbers were compared to those in c-CbI*'* littermates (8 week old). 
Approximately 5X 10° bone marrow cells collected from c-CbI*’* and 
c-Cbl ‘~ mice were inoculated into MethoCult M3231 culture supplemented 
with TPO (20 ng ml” '), IL3 (10 ng ml '), IL6 (10 ng ml '), EPO (3 Um!) and 
SCF (50ng ml '). The number of colonies was counted 7 days after culturing. 
In vitro cell proliferation assays. Approximately 6 X 10° LSK cells from 
c-Cbl”'~ mice and their c-CbI*!* littermates (8 week old) were sorted into 
RetroNectin-coated 96-well U-bottom plates containing %-minimum essential 
medium supplemented with 1% fetal bovine serum (FBS), mouse SCF 
(50 ng ml — '), and human TPO (100 ng ml '). After 24h pre-incubation, retro- 
virus supernatant was added to each well at a multiplicity of infection of about 
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10. The plates were incubated for another 24h in the presence of protamine 
sulphate (10 pg ml~'), followed by repeated infection and extended culture for 
2 days in S-Clone SF-O3 medium (Sanko Junyaku) supplemented with 1% BSA, 
50ngml~' SCF and 50ngml_' TPO. On day4, fluorescent-marker-positive 
cells were sorted for subsequent analyses. Cell survival and proliferation of 
LSK cells transduced with different C-CBL constructs were assessed in serum- 
free liquid culture in 96-well U-bottom plates in the presence of various cyto- 
kines. Each well received 50 fluorescent-marker-positive LSK cells, and the cells 
were cultured in S-Clone supplemented with 1% BSA plus SCF, TPO, IL3 or 
FLT3LG at the indicated concentrations. Cell numbers were counted either by 
analysing well images or by flow cytometry using FlowCount beads (Beckman 
Coulter). After 6h serum starvation, 1 X 10* LSK cells transduced with various 
C-CBL constructs were stimulated with SCF (10 ng ml!) and TPO (10 ng ml Ay 
for 15 min. Whole-cell lysates were examined for activation of STAT5 and Akt by 
immunoblots using the respective antibodies. 

Immunoblot analysis of physical interactions between mutant C-CBL and CBL- 
B. Flag-tagged CBL-B or C-CBL was co-transfected into NIH3T3 cells with each of 
three HA-tagged C-CBL mutants (C-CBL(Gln367Pro), C-CBL(Tyr371Ser) and 
C-CBL(70Z)). Total cell lysates of these NIH3T3 cells were immunoprecipitated 
with anti-Flag antibody, followed by immunoblot analysis with anti-HA antibody. 
Detection of ubiquitination and phosphorylation of kinases. After overnight 
serum starvation, NIH3T3 cells stably transduced with human EGFR, and indi- 
cated HA-tagged C-CBL mutants and Flag-tagged wild-type C-CBL were stimu- 
lated with human EGF (10ngml~') for 2min. Cell lysates were immuno- 
precipitated with anti-EGF antibody, followed by immunoblotting using anti- 
ubiquitin antibody. Constructs for wild-type C-CBL and mutant C-CBL were 
stably transduced into a mast cell line, V3MC, FLT3-transduced 32D cells (32D/ 
FLT3) and BaF3 cells transduced with human EPOR and JAK2 (BaF3/EPOR/ 
JAK2) using retrovirus-mediated gene transfer. After overnight serum star- 
vation, the transduced cells were stimulated with 10 ng ml! SCE (V3MC), 
10Uml' EPO (BaF3/EPOR/JAK2) or 10ngml~! FLT3LG (32D/FLT3) 
for lmin. The specific kinases were immunoprecipitated with relevant 
antibodies, and their ubiquitination was detected by immunoblotting with 
anti-ubiquitin antibody. Tyrosine phosphorylation of EGFR, c-KIT, JAK2 and 
FLT3 was examined by immunoblot analyses of total cell lysates after cytokine 
stimulation at indicated time points, using antibodies specifically recognizing 
phosphorylated kinases, anti-p-EGFR, anti-p-c-KIT, anti-p-JAK2 and anti-p- 
FLT3, respectively. Anti-GAPDH or anti-o-tubulin immunoblot was performed 
as a control. Antibodies used in this study are listed in Supplementary Table 9. 
Statistical analysis. Statistical significance of prolonged replating capacity of 
mutant C-CBL-transduced LSK cells was tested by counting the total number 
of dishes that produced colonies, followed by Fisher’s exact test. Survival curves 
of c-Cbl'!*, c-Cbl*'~ and c-Cbl”'~ mice containing the BCR-ABL transgene 
were generated using the Kaplan-Meier method. Overall survivals of C-CBL- 
mutated and non-mutated CMML cases were analysed according to the propor- 
tional hazard model, using STATA software. Statistical differences in survival 
were evaluated using the log-rank test, and statistical differences in 2 X 2 con- 
tingency tables were tested according to Fisher’s exact method. Student’s t-tests 
were used to evaluate the significance of difference in spleen mass, number of 
teense oieic progenitors and colony-forming cells between c-Cbl‘/* and 
c-Cbl" 
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A role for Lin28 in primordial germ-cell development 


and germ-cell malignancy 


Jason A. West’”, Srinivas R. Viswanathan’”, Akiko Yabuuchi’”, Kerianne Cunniff’?, Ayumu Takeuchi’, 
In-Hyun Park'”, Julia E. Sero®, Hao Zhu’”, Antonio Perez-Atayde’, A. Lindsay Frazier’*, M. Azim Surani” 


& George Q. Daley’”*” 


The rarity and inaccessibility of the earliest primordial germ cells 
(PGCs) in the mouse embryo thwart efforts to investigate molecular 
mechanisms of germ-cell specification. stella (also called Dppa3) 
marks the rare founder population of the germ lineage’”. Here we 
differentiate mouse embryonic stem cells carrying a stella transgenic 
reporter into putative PGCs in vitro. The Stella* cells possess a 
transcriptional profile similar to embryo-derived PGCs, and like 
their counterparts in vivo, lose imprints in a time-dependent 
manner. Using inhibitory RNAs to screen candidate genes for effects 
on the development of Stella* cells in vitro, we discovered that 
Lin28, a negative regulator of let-7 microRNA processing**, is 
essential for proper PGC development. Furthermore, we show that 
Blimp1 (also called Prdm1), a let-7 target and a master regulator of 
PGC specification’ °, can rescue the effect of Lin28 deficiency during 
PGC development, thereby establishing a mechanism of action for 
Lin28 during PGC specification. Overexpression of Lin28 promotes 
formation of Stella* cells in vitro and PGCs in chimaeric embryos, 
and is associated with human germ-cell tumours. The differenti- 
ation of putative PGCs from embryonic stem cells in vitro recapi- 
tulates the early stages of gamete development in vivo, and provides 
an accessible system for discovering novel genes involved in germ- 
cell development and malignancy. 

The transcriptional program responsible for specifying PGCs has 
been investigated through molecular analysis of cDNA created from 
single cells'®"', but because transgenic or knockout methods required 
to verify roles in germ-cell formation are cumbersome, only a small 
number of genes has been demonstrated to be essential in germ-cell 
formation, most notably Blimp1 and Prdm14 (refs 8, 9, 12). Building 
on our previous demonstration of germ-cell formation from embry- 
onic stem (ES) cells in vitro’, we have further characterized differenti- 
ation into putative PGCs, and now exploit the system to screen 
candidate genes via RNA interference (RNAi) to discover novel 
regulators of PGC development. For these studies, we used embryoid 
body (EB) differentiation of ES cells carrying a transgenic Stella—GFP 
reporter that specifically marks the germ-cell lineage in vivo’ (Fig. la 
and Supplementary Fig. 1). The Stella-GFP reporter, which is 
expressed in approximately 20% of cells within each ES cell colony, 
was rapidly downregulated during EB differentiation (Fig. la and 
Supplementary Fig. 2a). Reporter gene expression in ES cells reflects 
known Stella expression in the inner cell mass and does not seem to 
represent pre-existing germ cells'*'°. Rather, as shown below, a small 
percentage of Stella* cells arises in punctate structures within the EB, 
which represent putative PGCs (Fig. la and Supplementary Fig. 2)””°. 


During mouse development, loss of genomic imprinting occurs 
solely in the germ line and is a prerequisite for the sex-specific re- 
establishment of imprints during gametogenesis, thus establishing 
loss of imprinting as a unique marker of the germ lineage'®. Given 
the fidelity of the stella reporter in vivo', we predicted that the Stella* 
population derived during ES cell differentiation would demonstrate 
a time-dependent loss of imprints when compared to Stella-negative 
fractions and the parental ES cells. Indeed, Stella* cells purified 
directly from EBs or clonally derived under growth conditions for 
embryonic germ cells (EGCs) showed extensive loss of methylation at 
imprinted loci from day 7 of differentiation onwards (Fig. 1b and 
Supplementary Figs 3 and 4). These EB-derived EGCs displayed 
pluripotency properties similar to their embryo-derived counter- 
parts (Supplementary Fig. 5)”. 

Quantitative gene expression analysis revealed that germ-cell- 
specific genes become highly enriched within the EB-derived Stella* 
cell fraction in a temporal pattern that recapitulates gene activation 
during germ-cell specification in vivo (Fig. 1c—e). Using microarrays, 
we observed that Stella" cells purified from day 7 EBs show a trans- 
criptional profile with significant similarities to embryo-derived PGCs. 
Unsupervised principal component analysis (PCA) of the microarray 
data revealed the close clustering of day 7 Stella” EB-derived cells with 
embryo-derived E10.5 PGCs (Fig. 1f). Additionally, among a set of 178 
genes that displayed at least a twofold change in expression in Stella* 
cells from day 7 EBs (when compared to Stella* ES cells, Stella-negative 
ES cells and Stella-negative cells from day 7 EBs), germ-cell-specific 
transcripts were highly represented in the EB-derived Stella“ cell popu- 
lation (P = 0.0009; Supplementary Fig. 6a, b). Scatter plot representa- 
tions of the microarray data comparing Stella” cells purified from day 
7 EBs versus either Stella-GFP ES cells or embryo-derived E10.5 PGCs 
were created to highlight individual gene expression similarities and 
differences (Supplementary Fig. 6c, d). These microarray data reveal 
that the overall transcriptional profile of EB-derived Stella* cells is 
highly correlated with PGCs. 

We next sought to use this system of in vitro germ-cell specification 
to characterize the loss-of-function phenotypes for a number of can- 
didate genes (1 = 30) identified through our microarray analysis and 
reports of transcriptional profiling of PGCs’°. We assessed the effects 
of gene knockdown on both tissue-nonspecific alkaline phosphatase- 
positive (TNAP*) EGC colony formation and on the loss of imprints 
during differentiation to link candidate gene function to germ lineage 
specification. TNAP staining is a hallmark of PGCs and EGCs. We 
knocked down endogenous expression of each gene within Stella—GFP 
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Massachusetts 02115, USA. “Channing Laboratory, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA. °Wellcome Trust Cancer Research 
UK Gurdon Institute of Cancer and Developmental Biology, University of Cambridge, Tennis Court Road, Cambridge CB2 1QN, UK. °Manton Center for Orphan Disease Research, 
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Figure 1| ES cell differentiation into putative 


+ stella PGCs in vitro is marked by stella expression. 
i Oct4 ; 
<~ Blimp1 a, Immunofluorescent detection of Stella—GFP, 


F-actin and DAPI expression in ES cells and EBs 
at days 4 and 7 (X63 confocal objective). 
b, Bisulphite sequencing at the KYDMR1 
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ES cells by delivering short hairpin RNAs (shRNAs) via lentiviral 
transduction (Supplementary Fig. 7). shRNAs directed against 
Blimp1, an established regulator of PGC development*”, and lacZ 
served as controls. We differentiated ES cells carrying shRNAs for 
9days and assayed Stella-GFP* cells for their ability to form 
TNAP~ EGC colonies after 5 days of retinoic acid treatment. The 
addition of retinoic acid to our selection media after EB differentiation 
served to promote the self-renewal of PGCs and to differentiate poten- 
tial contaminating ES cells. Notably, transgenic Stella-GFP expression 
during EB differentiation was largely unaffected under all conditions, 
including Blimp 1 knockdown (data not shown), whereas endogenous 
Stella is modulated by gene manipulation (see below). This suggests 
that the endogenous stella locus is more tightly regulated than the 
transgene in vitro. Although Blimp1 knockdown would be expected 
to abrogate Stella expression completely, Blimp 1 homozygous knock- 
out mice still form stella-expressing cells, although these cells are rare 
and fail to proliferate and develop properly*'®. Thus, we focused on 
functional aspects of germ-cell formation, including TNAP colony 
formation and loss of genomic imprints. Among the candidate genes 
that we assessed, knockdown of Lin28 demonstrated the most quanti- 
tative reduction in TNAP-positive colony formation (Fig. 2a and 
Supplementary Fig. 8a). Further corroborating the loss of germ cells, 
we verified that knockdown of Lin28 abrogates the ability to derive 
imprint-erased clones after retinoic acid selection of EB-derived 
Stella* cells (Fig. 2b). 

Lin28 selectively blocks the processing of let-7 precursors into the 
corresponding mature microRNA (miRNA) species* *. Although not 
previously suspected as a regulator of PGC formation, we included 
Lin28 in our screen because it was more highly expressed in day 7 EB- 
derived Stella~ cells than in ES cells and EB-derived Stella-negative 
cells (as determined by microarray). Interrogation of a microarray 
data set of embryo-derived single cells from the mouse PGC lineage 
indicated high Lin28 expression in the proximal epiblast, PGC pre- 
cursors and lineage-restricted PGCs, as well as in the posterior meso- 
derm surrounding these cells'®, We evaluated Lin28 protein 
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expression during PGC development in mouse embryos and 
observed high levels of Lin28 staining within Stella” PGCs at E7.5, 
and only diffuse low-level Lin28 staining in surrounding somatic 
cells. We observed diminishing yet persistent expression of Lin28 
within PGCs through E12.5, at which time Lin28-negative PGCs 
become apparent (Fig. 2c). 

To elucidate further the role of Lin28 in the development of germ 
cells, we characterized the effects of modulating Lin28 expression on 
germ-cell-marker gene expression during ES cell differentiation in 
vitro. Quantitative gene expression analysis revealed that Lin28 RNAi 
decreased the expression of multiple germ-cell markers in Stella— 
GFP* cells during EB differentiation, including key genes implicated 
in PGC commitment, such as Blimp1, Prdm14 and endogenous stella 
(Supplementary Fig. 9a—i). Moreover, we confirmed that Lin28 RNAi 
resulted in increased levels of mature let-7 miRNA family members 
by approximately five- to sixfold in Stella* cells (Supplementary Fig. 
9z-b'). Lin28 RNAi markedly reduced TNAP colony formation 
throughout EB differentiation of Stella—GFP ES cells, consistent with 
a reduction in the generation of putative PGCs (Fig. 2d). Three 
individual shRNAs directed against Lin28 resulted in a similar 
phenotype, and ectopic Lin28 expression resistant to shRNA inhibi- 
tion was able to stimulate formation of TNAP* colonies and re- 
expression of germ-cell-specific genes, thereby arguing against off- 
target effects of Lin28 RNAi (Supplementary Fig. 10). 

We next drove ectopic Lin28 expression during EB differentiation 
from a tetracycline-inducible ES cell line (iLin28 ES cells). Addition 
of doxycycline to the media resulted in a pronounced upregulation of 
Lin28 throughout differentiation, and a corresponding suppression 
of let-7 maturation (Supplementary Fig. 9j, q, c’-e’). Induction of 
Lin28 during EB differentiation was accompanied by increased 
expression of multiple germ-cell markers, including stella, Prdm14 
and Texl4 within whole EBs as well as within SSEAI* cells 
(Supplementary Fig. 9j-y). Notably, we found no evidence that 
ectopic Lin28 expression influenced Blimp1 messenger RNA levels 
(Supplementary Fig. 91, s), which could be a result of the reported 
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Figure 2 | Lin28 regulates PGC development. a, The effects of candidate 
gene knockdown on TNAP~ EGC colony formation from day 9 EB-derived 
Stella* cells after in vitro differentiation of ES cells carrying shRNA- 
mediated gene knockdown, as indicated. n = 3. b, Imprint status at the Igf2r 
and Rasgrf1 loci of individual clones derived from day 9 EB-derived Stella* 
cells carrying gene knockdown of either lacZ, Blimp 1 or Lin28. c, Expression 
of Lin28 during embryonic PGC development. By E12.5, multiple Stella~ 
PGCs within the genital ridge are negative for Lin28 (white arrows; X63 
confocal objective). d, Lin28 RNAi prevents TNAP~ EGC colony formation 
during EB differentiation. n = 3. e, Induced Lin28 expression enhances 
TNAP™ EGC colony formation on and after day 7 of EB differentiation 
compared to the uninduced control. n = 3. Dox., doxycycline. All error bars 
represent s.e.m. 


post-transcriptional regulation of Blimp! by let-7 (ref. 7). Func- 
tionally, ectopic Lin28 expression during EB differentiation resulted 
in significant increases in TNAP* EGC colony formation when com- 
pared to uninduced controls, indicating that enforced expression of 
Lin28 promotes expansion of PGC pools (Fig. 2e). 

To corroborate our in vitro results of Lin28 knockdown within a 
physiological setting in vivo, we injected knockdown ES cells into 
blastocysts and monitored the germ-cell contribution in the resulting 
chimaeric embryos via Stella—GFP transgenic reporter expression. 
Control embryos injected with Stella—GFP ES cells expressing a con- 
stitutive lacZ RNAi construct showed extensive contribution of 
GFP* germ cells in the genital ridge, whereas Blimp1 RNAi ablated 
germ-cell commitment (Fig. 3a, b). Lin28 knockdown compromised 
PGC formation to the same extent as Blimp1 knockdown. Failure of 
Lin28 RNAi ES cells to contribute to the germ line was not due to a 
generalized deficit in the potential to chimaerize tissues, as Lin28 
RNAi ES cells were detected in the tissues of the head and limbs in 
virtually all embryos tested (Supplementary Fig. 11). Stella-GFP 
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expression was not observed in somatic areas of chimaeric embryos, 
indicating the specificity of the Stella reporter’. Furthermore, these 
data suggest a cell-autonomous role for Lin28 in PGC formation, as 
chimaeric embryos possess mosaic tissues with presumably normal 
elements derived from the host that would be expected to 
complement deficits caused by Lin28 deficiency in non-germ cells. 
Thus, our data indicate a physiological role for Lin28 in PGC 
development in vivo. 

As a complementary approach to examine the impact of Lin28 on 
PGC numbers in vivo, we transduced Stella-GFP ES cells with a 
retrovirus that drives the ectopic expression of Lin28, injected blas- 
tocysts, and quantified ES cell contribution to the genital ridge. 
Ectopic Lin28 expression resulted in a significant enhancement of 
PGC number in chimaeric embryos (Fig. 3a, b), thus suggesting that 
Lin28 can both positively and negatively regulate the pool of germ 
cells. 

As the 3’ untranslated region (UTR) of Blimp! has been identified 
as a direct target for let-7 miRNAs’, we hypothesized that Lin28 
influences PGC development through let-7-mediated effects on 
Blimp1. Blimp1 is a key regulator of germ-cell commitment*’, and 
we sought to determine if ectopic expression of Blimp] lacking its 3’ 
UTR could rescue PGC development in the setting of Lin28 deple- 
tion. We engineered Stella—GFP ES cells possessing both Lin28 RNAi 
and ectopic Blimp1 expression and assayed the ability of these cells to 
contribute to the germ line in chimaeric E12.5 embryos. In this 
context, Blimp1 rescues PGC numbers to control levels (Fig. 3a, b). 
Additionally, Blimp1 RNAi ablates PGC development from ES cells 
ectopically expressing Lin28 (Fig. 3a, b). These data suggest that 
Lin28 is as an upstream regulator of PGC development relative to 
Blimp1, potentially acting via repression of the let-7 miRNA family 
within PGCs or their precursors. To test this hypothesis further, we 
engineered a doxycycline-inducible ES cell line to express a form of 
let-7 (let-7s-21L) that is resistant to Lin28 by virtue of a terminal loop 
substitution with that of miR-21 (a miRNA not regulated by Lin28)"*, 
and determined the effect of induction on Blimp! expression in EBs. 
We found that induction of let-7s-21L abrogates Blimp1 protein 
expression during EB differentiation (Fig. 3c); further supporting 
Lin28 regulation of Blimp! via let-7. 

Expression of Lin28b, a homologue of Lin28, is largely absent in 
early PGCs”°, but is expressed in ES cells. Thus, we also assessed the 
impact of Lin28b RNAi on PGC formation. We observed a compro- 
mised ability to form EB-derived PGCs owing to Lin28b knockdown 
and an intermediate phenotype in vivo (Supplementary Fig. 8). 
Knockdown of Lin28b in ES cells similarly results in upregulation 
of let-7 and compromises germ-cell formation, further linking let-7 
to PGC development but indicating that Lin28 is the dominant regu- 
lator in vivo. 

Given our data linking Lin28 and germ-cell numbers, we surmised 
that aberrant overexpression of Lin28 might dysregulate the growth or 
differentiation of pluripotent cells including germ cells. Furthermore, 
recent reports have linked Lin28 and its homologue Lin28b to onco- 
genesis'”*'. In teratomas formed from ES cells engineered to express 
Lin28 in response to doxycycline induction (iLin28 ES cells) we 
observed that primitive neural tissue predominated in regions of 
transgene induction (Supplementary Fig. 12a). High-grade, aggressive 
human teratocarcinomas contain abundant immature neural tissue, 
whereas low-grade, benign teratomas contain no immature neural 
tissue. This phenomenon is also observed in teratomas formed from 
inducible Oct4 ES cells’. In contrast, Lin28 RNAi during teratoma 
formation resulted in reduced tumour size, and these teratomas were 
comprised of mature tissues from all three germ layers (Supplemen- 
tary Fig. 12b, c). Lin28 and Lin28b are more prominently over- 
expressed in the transformed embryonal carcinoma (EC) cell line 
P19 relative to ES cells than corresponding Oc?4 levels in these cells 
(Supplementary Fig. 13a). Modulation of Lin28 expression by RNAi 
in P19 cells reduces both cell proliferation and tumour formation in 
immunodeficient mice, suggesting that Lin28 contributes to the 
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Figure 3 | Lin28 and Blimp1 regulate PGC development in vivo. 

a, Representative images of genital ridges from E12.5 embryos injected with 
Stella—GFP ES cells harbouring either lacZ RNAi, Blimp1 RNAi, ectopic 
Blimp1 expression, Lin28 RNAi, ectopic Lin28 expression, Lin28 RNAi plus 
ectopic Blimp1 expression, or ectopic Lin28 plus Blimp1 RNAi. Scale bar, 
100 um. b, Quantification by flow cytometry of the percentage of Stella* 
PGCs in E12.5 genital ridges dissected from chimaeric embryos created with 


malignant phenotype of embryonal carcinoma cells (Supplementary 
Fig. 13b-e). In addition, tumours with ectopic Lin28 displayed local 
invasion into the host tissues, whereas teratomas formed from plur- 
ipotent cells treated with Lin28 RNAi were differentiated, non-invas- 
ive and well encapsulated (Supplementary Fig. 13d, e), paralleling 
recent reports linking Lin28 to metastasis’. 

We sought to determine whether LIN28 is expressed in human 
germ-cell tumours. We interrogated a germ-cell tumour microarray 
data set and profiled multiple primary human tumours for the 
expression of LIN28, LIN28B and OCT4, a known marker of germ- 
cell tumours”’. We observed that malignant tumours of a germ-cell 
origin, including mixed germ-cell tumours, yolk-sac tumours, chor- 
iocarcinomas, embryonal carcinomas and seminomas, demonstrated 
overexpression of LIN28 and/or LIN28B (Fig. 4a). Importantly, 
benign teratomas and normal testis do not show high levels of 
LIN28 and LIN28B expression relative to malignant tumour types, 
and OCT4 is not expressed in yolk-sac tumours or choriocarcino- 
mas”. The expression of LIN28 and/or LIN28B in all malignant 


Mixed GCT 


YST C 


Stella-GFP ES cells containing the following: lacZ RNAi, Blimp1 RNAi, 
ectopic Blimp1 expression, Lin28 RNAi, ectopic Lin28 expression, ectopic 
Lin28 plus Blimp1 RNAi, or Lin28 RNAi plus ectopic Blimp1 expression. 
Black horizontal bars represent the median. NS, nonsignificant P values. 

c, Immunoblot for Blimp1 expression in extracts from ES cells and day 5 EBs 
with or without doxycycline induction of let-7s-21L. Gapdh served as a 
loading control. 


germ-cell tumour subtypes implicates LIN28 as a consistent marker 
of germ-cell tumours. We confirmed LIN28 protein expression in a 
primary human embryonal carcinoma by immunohistochemistry 
(Fig. 4b). Importantly, we noted that LIN28 expression was specif- 
ically observed in malignant components of the tumour, but not in 
benign elements of the tumour or in the adjacent stroma. 
Interestingly, regions of intratubular germ-cell neoplasia (a pre- 
cursor lesion for most germ-cell tumours) stained strongly positive 
for LIN28 whereas normal seminiferous tubules displayed little to no 
staining (Fig. 4b, right panel), suggesting that aberrant expression of 
LIN28 might be an early lesion in germ-cell tumorigenesis. In a 
separate study, we have documented that LIN28 overexpression facil- 
itates cellular transformation in NIH-3T3 and BaF3 cells, consistent 
with a functional role for LIN28 in malignancy”’. 

Lin28 selectively blocks the processing of the let-7 family of 
miRNAs’ *. let-7 miRNAs are upregulated only during later stages 
of PGC development”, establishing that let-7 expression is dynam- 
ically regulated within the germ lineage. Our data suggest that Lin28 
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Figure 4 | LIN28 expression in human germ-cell tumours. a, Primary 
malignant germ-cell tumours of multiple histologies show consistent high 
expression of LIN28 and LIN28B. Benign teratomas and normal testis show 
no significant expression of LIN28 or LIN28B. C, choriocarcinoma; EC, 
embryonal carcinoma; GCT, germ-cell tumour; S, seminoma; Ter, teratoma; 
Tes, normal testis; YST, yolk-sac tumour. b, Immunohistochemical 
detection of LIN28 in primary embryonal carcinoma. Left: cytoplasmic 
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immunoreactivity of embryonal carcinoma cells for LIN28 (between 
arrowheads). Endodermal epithelial elements of the teratoma (arrows) and 
tumoural stroma are not immunoreactive. Middle: higher magnification of 
immunoreactive embryonal carcinoma. Adjacent seminiferous tubules are 
negative (between arrows). Right: intratubular germ-cell neoplasia within 
the embryonal carcinoma demonstrates LIN28 immunoreactivity. x40 
magnification. 
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suppression of let-7 maturation is required for proper Blimp1- 
mediated development of the germ lineage. Consistent with this 
hypothesis, we have observed that Lin28 RNAi is associated with 
upregulation of let-7 miRNAs and the abrogation of Blimp1 express- 
ion in Stella* cells during in vitro differentiation (Supplementary Fig. 
9c, z, a’-e'). Additionally, ectopic Blimp1 expression rescues PGC 
formation from ES cells harbouring Lin28 knockdown. Our Lin28 
expression data in PGCs correlate inversely with the expression of 
let-7 miRNAs” and are coincident with Blimp! protein dynamics in 
PGCs”*. Although Blimp1 seems critical, additional let-7 targets may 
yet prove relevant to PGC specification. 

Our functional data indicate that Lin28 is a key lineage regulator 
for germ-cell numbers (Supplementary Fig. 14). As multiple lineage- 
specifying factors such as MITF (microphthalmia-associated tran- 
scription factor)’” and SCL (stem cell leukaemia gene)** have been 
implicated as oncogenes, Lin28 could have a dual role in both germ- 
cell development and malignancy. As let-7 miRNAs function as 
tumour suppressors by directly and indirectly repressing a number 
of cell proliferation pathways, Lin28 de-repression of let-7 oncogenes 
has been established as a key oncogenic factor’?*'. Multiple tumour 
types show reduced levels of let-7 miRNAs, and several let-7 targets, 
including c-Myc, k-Ras and HMGA2, are oncogenic”’”’. In addition, 
somatic cell reprogramming factors (including presumably Lin28) 
also have prominent roles in cellular transformation. Oct4 has been 
identified as an oncogenic driver for certain germ-cell tumours, 
including embryonal carcinomas and seminomas but not yolk-sac 
tumours and choriocarcinomas***. Here we show that Lin28 and its 
homologue Lin28b are more consistently expressed in malignant 
germ-cell tumours than even Oct4, including within yolk-sac 
tumours and choriocarcinomas. Together, our data implicate 
Lin28 as a novel regulator of germ-cell development and a consistent 
marker of germ-cell malignancy, and provide new insights into germ- 
cell formation, development and cancer. 


METHODS SUMMARY 


shRNA constructs in the pLKO.1-Puro vector system for RNAi-mediated gene 
knockdown were obtained from Sigma and Open Biosystems (MISSION TRC- 
Mm 1.0). TRC numbers for individual hairpins as well as primer information for 
real-time PCR analysis can be found in Supplementary Tables 1 and 2. EB- 
derived PGC microarray analysis used Affymetrix mouse 430 2.0 whole genome 
arrays, and data analysis used MASS5.0 standard normalization and differential 
expression analysis techniques using GeneSpring GX 7.3 software. Embryo- 
derived PGC and PGC-precursor microarray data sets were published previ- 
ously’? and have the Gene Expression Omnibus (GEO) accession number 
GSE11128. Primary human tissue microarray data sets were published previ- 
ously” and have the GEO accession number GSE3218. R. Yu and C. Estrada 
provided the pCDH-CMV-mPrdm1 construct. Antibodies used in this study 
include: mouse anti-SSEA1 (MC480; Developmental Studies Hybridoma Bank 
at the University of Iowa), rabbit anti-GFP (BioVision), goat anti-Lin28 for 
immunofluorescence and western blotting (Santa Cruz), anti-LIN28 for human 
germ-cell tumour immunohistochemistry (Proteintech Group), anti-Blimp1 
(R&D Systems), and Alexa-594 phalloidin (Molecular Probes). In Fig. 3b, P 
values were calculated by the Mann-Whitney rank-sum test and are relative to 
the control, lacZ RNAi. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

ES cell culture, EB differentiation, PGC selection and EGC culture. Transgenic 
Stella—GFP and Stella BAC ES cells (B6/CBA) were cultured as described, and the 
hanging drop method was used for ES cell differentiation as EBs”'**°?. All data 
presented used the Stella—GFP transgenic ES cell line unless indicated’. Inducible 
Lin28 ES cells (iLin28 ES cells) and inducible let-7s-21L ES cells were created 
from the KH2 ES cell line as described!***, and 2 ug ml ' doxycycline was added 
for induced expression during differentiation. 

For PGC selection, EBs were dissociated and the single-cell suspension was 
subjected to FACS using a BD FACS Aria cell sorting system to purify Stella— 
GFP-positive cells. For the iLin28 ES cell line, SSEA1 purification was used for 
germ-cell isolation, as described'***. For EGC culture, FACS-processed cells were 
plated on MEFs, with mouse ES cell media containing 15ngml~! of bFGF 
(Invitrogen) and 30 ng ml! of SCF (Peprotech), plus 2 UM trans-retinoic acid 
(Sigma). Retinoic acid promotes germ-cell self-renewal while serving as a 
differentiation agent for ES cells**. After 5-7 days, replacing the media daily, 
cultures were fixed and stained for tissue-nonspecific alkaline phosphatase 
(TNAP), a germ-cell-specific marker*’, or clonal EGC colonies were isolated 
and cultured as previously described and expanded for analysis”. For TNAP 
staining, cultured cells were washed once with PBS, fixed in 4% paraformalde- 
hyde (PFA) for 1 min, and then washed in PBS three times. AB staining solution 
(10 ml buffer (100 mM Tris HCl pH 9.5, 50 mM MgCh, 100 mM NaCland 0.1% 
Tween-20), 22.5 pl of 50mgml ' nitro-blue tetrazolium in 70% dimethylfor- 
mamide (NBT; Promega), and 17.5 pl of 50mg ml! 5-bromo-4-chloro-3- 
indoyl-1-phosphate in 100% dimethylformamide (BCIP; Promega)) was added 
to the cells and incubated at room temperature for 5-10 min, and then the cells 
were washed in PBS several times and imaged. 

Confocal imaging. Embryos or EBs were fixed in 4% PFA (in PBS) for 1h at 
room temperature, washed 3X in PBS (5 min per wash) and permeabilized for 
1h at room temperature in 1X IF buffer (PBS plus 0.1% BSA, 0.2% Triton 
X-100, 0.04% Tween-20). They were then incubated with 1:200 primary rabbit 
anti-GFP (BioVision) or 1:200 primary goat anti-Lin28 (Santa Cruz) in IF buffer 
at 37 °C for 1h, washed 3 X with IF buffer 5 min each at room temperature with a 
final wash in IF buffer at 37 °C for 10 min. They were then incubated with 1:500 
secondary (Alexa488 and/or Alexa594, Molecular Probes) or 1:200 Alexa-594 
phalloidin (Molecular Probes) and DAPI (1:100 from 5 mg ml! stock) in IF 
buffer for 1 h at 37 °C. Samples were placed in coverslip-bottom dishes (MatTek) 
in Fluoromount G mounting medium (Southern Biotech) and imaged ona Leica 
confocal microscope (X63 objective with oil). 

Quantitative real-time RT-PCR. Quantitative real-time PCR of gene expression 
used SYBR green. Multiple, independent biological samples were obtained for 
RNA isolation. Levels of mature miRNAs were measured using Taqman probes 
(Applied Biosystems). B-actin expression was used to normalize all samples 
except for miRNA expression, which were normalized to snol42 RNA. 
Relative fold changes were calculated using the AAC, method. See 
Supplementary Table 1 for primer sequences. 

Microarray hybridization and data analysis. Each sample contained two to four 
biological replicates. RNA was isolated using an RNeasy kit (Qiagen) according 
to the manufacturer’s protocol. Biotin-labelled RNA was hybridized to 
Affymetrix mouse 430 2.0 whole genome arrays at the Dana-Farber Cancer 
Institute Microarray Core. GeneSpring GX 7.3 software was used for all data 
analysis, including scatter plot generation, using MAS5.0 standard normaliza- 
tion and differential expression analysis techniques. Principal component ana- 
lysis (PCA) was performed via GeneSpring using mean centring and scaling for 
all genes. Raw data for EB-derived PGCs and associated cell types are deposited 
in the public gene expression database GEO (http://www.ncbi.nlm.nih.gov/geo/) 
under the accession number GSE7948. Embryo-derived PGC and PGC-precursor 
microarray data sets were published previously’? and have the GEO accession 
number GSE11128. 

Southern blot analysis. Imprint analysis for the Igf2r locus was at the maternally 
methylated differentially methylated region 1 (DMR1). Genomic (g) DNA was 
digested with PvulI/Mlul and probed via Southern blot hybridization as 
described’*. For the paternally methylated Rasgrf1 locus, Southern blot hybridi- 
zation of gDNA digested with PstI/Notl was probed as described*’. 

Bisulphite sequencing. Bisulphite treatment of gDNA was carried out using a 
Chemicon CpGenome DNA Modification Kit according to the manufacturer’s 
protocol. Sample treatment and processing were performed simultaneously for 
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quality control. Nested PCR was then used to amplify KvDMRI”* and the Snrpn 
DMRIas described. The KyDMRI1 region examined is on distal chromosome 7, 
contains 33 CpGs within intron 10 of the Keng gene, and controls the expression 
of six paternally repressed genes: Ascl2, Tssc4, Keng, Cdknic, Slc22a18 and 
Tssc3°**. The maternally repressed Snrpn DMR1 examined is located on central 
chromosome 7 and contains 11 CpGs*’. PCR products were cloned from two 
independent PCR reactions. Bisulphite conversion efficiency ranged from 85% 
to 99%, and 83% to 100% of individual clones for each sample showed unique 
patterns of conversion. 

RNA interference. shRNA constructs in the pLKO.1-Puro vector system for 
RNAi-mediated gene knockdown were obtained from Sigma and Open 
Biosystems (MISSION TRC-Mm 1.0). Expression of the shRNA was constitutively 
driven by the human U6 promoter whereas the human phosphoglycerate kinase 
promoter (PGK) independently drove the expression of a puromycin resistance 
gene to allow selection for cells harbouring the shRNA-mediated gene knockdown. 
TRC numbers for hairpins used are listed in Supplementary Table 2. For Lin28, 
only shRNA TRCN0000102579 was used for all experiments, unless otherwise 
noted. Viral production in 293T cells and ES cell infection were performed accord- 
ing to product literature and as described’. Puromycin selection was applied 
continuously during all subsequent cell culture including EB differentiation. 
Ectopic gene expression vectors. Ectopic Lin28 was provided by either pBABE- 
puro-Lin28 or MSCV-neo-Lin28, whereas ectopic Blimp1 was driven by pCDH- 
CMV-mPrdm1 (System Biosystems; provided by R. Yu and C. Estrada). 
Tumour formation. All tumour-forming assays, including teratoma formation, 
were performed using NOD-SCID mice (Charles River Laboratories). 1 X 10° 
cells in 150 pl PBS were injected subcutaneously for all experiments. 
Production of chimaeric embryos. C57BL6 females were superovulated by 
PMSG followed 48h later by hCG and were mated with C57BL6 males. 
Blastocysts were collected from the uterus 3.5 days after vaginal plugging. 12— 
15 ES cells were injected into each blastocyst which was then transferred to the 
uterus of a 2.5-day postcoitum pseudo-pregnant CD-1 female. Ectopic Lin28 
expression was accomplished by retroviral infection of Stella—-GFP ES cells with 
pBABE-Lin28. Statistical significance of germ-cell chimaerism was determined 
by the Mann-Whitney rank-sum test. 

Human primary tumour and cell line information. Primary tissue microarray 
data sets were published previously’? and have the GEO accession number 
GSE3218. Log, transformed expression data were normalized to the average 
expression signal for normal tissue. Cutoff for overexpression was as a normalized 
expression value of 2 or greater. Immunohistochemistry of paraffin-embedded 
tissues was performed using standard techniques with LIN28 primary antibody 
staining (1/200; Proteintech Group) followed by streptavidin peroxidase incuba- 
tion (Lab Vision) and DAB kit colour developing (Vector Laboratories) according 
to the manufacturers’ protocol. Sections were counterstained with haematoxylin. 
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Multiple roles for MRE11 at uncapped telomeres 


Yibin Deng'*}, Xiaolan Guo'”*+, David O. Ferguson* & Sandy Chang'” 


Progressive telomere attrition or uncapping of the shelterin com- 
plex elicits a DNA damage response as a result of a cell’s inability to 
distinguish dysfunctional telomeric ends from DNA double- 
strand breaks’. Telomere deprotection activates both ataxia telan- 
giectasia mutated (ATM) and telangiectasia and Rad3-related 
(ATR) kinase-dependent DNA damage response pathways, and 
promotes efficient non-homologous end-joining (NHEJ) of dys- 
functional telomeres”. The mammalian MRE11-RAD50-NBS1 
(MRN; NBS1 is also known as NBN) complex interacts with 
ATM to sense chromosomal double-strand breaks and coordinate 
global DNA damage responses®’. Although the MRN complex 
accumulates at dysfunctional telomeres, it is not known whether 
mammalian MRN promotes repair at these sites. Here we address 
this question by using mouse alleles that either inactivate the 
entire MRN complex or eliminate only the nuclease activities of 
MRE11 (ref. 8). We show that cells lacking MRN do not activate 
ATM when telomeric repeat binding factor 2 (TRF2) is removed 
from telomeres, and ligase 4 (LIG4)-dependent chromosome end- 
to-end fusions are markedly reduced. Residual chromatid fusions 
involve only telomeres generated by leading strand synthesis. 
Notably, although cells deficient for MRE11 nuclease activity effi- 
ciently activate ATM and recruit 53BP1 (also known as TP53BP1) 
to deprotected telomeres, the 3’ telomeric overhang persists to 
prevent NHEJ-mediated chromosomal fusions. Removal of 
shelterin proteins that protect the 3’ overhang in the setting of 
MRE11 nuclease deficiency restores LIG4-dependent chromosome 
fusions. Our data indicate a critical role for the MRN complex in 
sensing dysfunctional telomeres, and show that in the absence of 
TRE2, MRE11 nuclease activity removes the 3’ telomeric overhang 
to promote chromosome fusions. MRE11 can also protect newly 
replicated leading strand telomeres from NHEJ by promoting 5’ 
strand resection to generate POT1a—TPP1-bound 3’ overhangs. 
The proper maintenance of telomeres is essential for global genome 
stability. Mammalian telomeres are composed of TTAGGG repeats 
bound to shelterin—a complex of six core proteins including the 
double-stranded DNA-binding proteins TRF1 and TRF2, and protec- 
tion of telomeres 1 (POT1) that interacts with its binding partner TPP1 
to protect single-stranded G-rich overhangs'. Telomeres rendered dys- 
functional by the removal of TRF2 are recognized as double-strand 
breaks (DSBs), activate ATM and are ligated by the NHEJ pathway to 
generate fused chromosomes’. The MRN complex recruits ATM to 
sites of DSBs, where it initiates a signalling cascade leading to check- 
point responses”'®. Although MRN localizes to deprotected telo- 
meres''!”, its role in NHEJ-mediated repair is not well understood. 
We sought to uncover the roles of the MRN complex in NHEJ by using 
two mouse alleles of MRE11: the Mrel1~’ null allele that also 
abolishes the formation of the MRN complex, and the Mre11""!??/“ 
nuclease-deficient allele—in which His 129 was mutated to Asn—that 
eliminates both endo- and exonuclease functions of MRE11 (ref. 8). 


To engage the NHEJ pathway, we removed TRF2 from telomeres with 
retrovirus-mediated short hairpin RNA (shRNA) to Trf2 in SV40LT 
immortalized mouse embryonic fibroblasts (MEFs) (Supplementary 
Fig. la). Metaphase spreads collected 72 to 120h after Trf2 shRNA 
treatment showed a progressive increase in the number of fused telo- 
meres until nearly all chromosomes are joined end-to-end 
(Supplementary Fig. 1b). This telomere fusion phenotype was sup- 
pressed when MEFs were first complemented with shRNA-resistant 
murine T7f2 complementary DNA before Trf2 shRNA treatment 
(Supplementary Fig. 1c, d and Supplementary Table 1), indicating that 
the telomere fusions observed are not due to off-target effects of T7f2 
shRNA treatment. 

To determine whether MRE11 is required for the DNA damage 
response (DDR) initiated by dysfunctional telomeres, MEFs contain- 
ing the conditional Mre11‘” allele were infected with adenovirus 
expressing Cre recombinase. The floxed Mre11 allele was efficiently 
deleted, with concomitant loss of MRE11 protein as confirmed by 
western blotting (Supplementary Fig. 2b, c). Because TRF2 represses 
ATM activation at telomeres*’, removal of TRE2 from Mre11'" cells 
resulted in robust ATM activation, phosphorylation of CHK2 and 
induction of phosphorylated histone 2AX (y-H2AX, also known as 
H2AFX) and 53BP1 telomere-induced DNA damage foci (TIF) that 
colocalized with telomeric DNA in 90% of cells examined (Fig. la, c 
and Supplementary Fig. 3a). However, both ATM phosphorylation 
and TIF formation were reduced to 12% in Mre11“/“ MEFs in res- 
ponse to TRF2 depletion (Fig. 1a, c and Supplementary Fig. 3a). To 
determine whether the MRE11 nuclease activity is required for ATM 
activation after induction of telomere dysfunction, Mre11'"!??N/4 
MEFs were generated (Supplementary Fig. 2a—c). In contrast to 
Mrel1™/“ MEFs, the phosphorylation of ATM and induction of 
TIF after TRF2 depletion in Mre11'"!?°N/ MEFs remained robust 
(Fig. 1b, d and Supplementary Fig. 3b). These results indicate that 
ATM phosphorylation in response to TRF2 depletion requires MRN, 
whereas the MRE11 nuclease activity is dispensable for this function. 
At murine telomeres, ATR activation is repressed by POT 1a*”. To test 
whether ATR activation at telomeres requires MRN or MRE11 nucle- 
ase activity, Potla shRNA was used to deplete POT1a in Mre11“/* 
and Mre11'"!?°\’4 MEFs. Removal of POT 1a resulted in robust ATR 
phosphorylation and TIF formation in both cell types (Fig. le, f and 
Supplementary Fig. 3c, d), suggesting that MRE11 is not required to 
activate an ATR-dependent DDR at telomeres after POT 1a loss. 

Our data demonstrate that the removal of TRF2 results in telo- 
meres being recognized as DSBs that elicit MRN-dependent DDR. To 
determine whether the repair of dysfunctional telomeres is impacted 
in the setting of MRE11 nuclease deficiency or after removal of 
the MRN complex, TRF2 was depleted in Mre11"*, Mre11“/4 and 
Mre11™!?°N'4 MEFs. Although Mrel1" cells showed extensive 
end-to-end chromosome fusions in a LIG4-dependent manner, 
Mrel11“* cells showed a 15-fold reduction in the number of 
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Figure 1| Activation of ATM but not ATR after TRF2 depletion requires the 
MRN complex but not MRE11 nuclease activity. a, b, y-H2AX-positive TIF 
in MEFs of the indicated genotypes after TRF2 depletion in MRE1 1-deficient 
SV40LT immortalized MEFs (a) and MRE11-nuclease-activity-deficient 
MEFs (b). Original magnification, < 630. ¢, d, Western blots detecting TRF2, 
MRE1I1, p-ATM, total and phosphorylated CHK2 (p-) after Trf2 shRNA 
treatment in MRE11-deficient SV40LT immortalized MEFs (¢) and MRE11 
nuclease-deficient MEFs (d). Tubulin served as loading control. 

e, f, Immunoblots for MRE11 and ATR phosphorylation, and RT-PCR to 
detect Potla and Gapdh transcripts, after Potla shRNA infection of MRE11- 
deficient SV40LT immortalized MEFs (e) and MRE11 nuclease-activity- 
deficient MEFs (f). Nsp, nonspecific protein used as a loading control. 


chromosome-chromosome fusions (Fig. 2a, Supplementary Fig. 4a— 
c and Supplementary Table 2). Unexpectedly, chromosome-chro- 
mosome fusions were similarly reduced in nuclease-deficient 
Mre11"!7°" cells, even though the DDR is activated after TRF2 loss 
(Figs 1d, 2a and Supplementary Table 2). These results indicate a 
critical role of the MRE11 nuclease activity in mediating NHEJ of 
dysfunctional telomeres. 

The marked reduction in the number of chromosome—chromosome 
fusions observed in Mre11'"?°“’ cells treated with Trf2 shRNA 
suggests that nucleolytic processing of telomeric ends might be 
required for NHEJ of dysfunctional telomeres. To test this hypothesis, 
we examined the status of the 3’ single-strand overhang using an in-gel 
hybridization assay. In contrast to Trf2-shRNA-treated Mre11'"“ cells, 
which showed rapid reduction of the 3’ overhang due to NHEJ- 
mediated processing’, the 3’ overhang persists in MrelI““ and 
Mre11'"!°™ cells with uncapped telomeres (Fig. 2b). Treatment with 
the 3’ end specific exonuclease Exol revealed that these overhangs are 
indeed single-stranded telomeric repeats (Supplementary Fig. 5a, b). 
The lack of overhang degradation in Mre11"!?°™ cells treated with 
Trf2 shRNA suggests that nucleolytic processing of the 3’ telomeric 
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Figure 2 | Requirement for MRN and MRE11 nuclease activity in NHEJ of 
telomeres lacking TRF2. a, MEFs of the indicated genotypes were treated 
with control vector or Trf2 shRNA for 120 h, metaphases were prepared and 
telomere fusions were visualized by telomere peptide nucleic acid (PNA)- 
FISH (red) and 4,6-diamidino-2-phenylindole (DAPI; blue). b, In-gel 
hybridization assay using a clamped homogenous electric field (CHEF) gel 
to fractionate genomic DNA, then hybridized in situ to a (CCCTAA), probe 
to detect the 3’ single strand (SS) overhang under native conditions (left) 
and under denatured condition (right) to detect total TTAGGG repeats 
(right). c, Schematic of fusion products expected in 
chromosome-chromosome, leading-leading chromatid and 
lagging—lagging chromatid fusions. d, Fusion products of Trf2-shRNA- 
treated cells of the indicated genotypes analysed by CO-FISH. DNA was 
detected by DAPI (blue). Telomeric probes used in CO-FISH experiments 
are FITC-OO-(TTAGGG),, to label the leading strand, and Tam-OO- 
(CCCTAA), to label the lagging strand. Representative images are shown. 
e, Quantification of the types of fusions observed in cells of the indicated 
genotypes expressed as a percentage of total fusions observed after TRF2 
depletion. A minimum of 150 independent fusion events with telomeric 
signals at the sites of fusions were characterized per genotype. Original 
magnification for all images, < 630. 


915 


©2009 Macmillan Publishers Limited. All rights reserved 


LETTERS 


overhang by MRE11 nuclease activity is required for efficient NHEJ of 
telomeres rendered dysfunctional by the removal of TRF2. 

Interestingly, telomere fusions were not completely abolished in 
Trf2-shRNA-treated Mre11 cells. Instead, chromosome-orientation 
fluorescent in situ hybridization (CO-FISH) showed that ~90% of 
these rare fusions involved the leading strands of sister chromatids 
(Fig. 2c—e). After telomeric replication, telomeres formed by leading 
strand synthesis are initially blunt, whereas lagging strand DNA syn- 
thesis results in the formation of a 3’ overhang after the removal of the 
last synthesis primer. Leading strand telomeric ends have to be enzy- 
matically processed after replication to produce a 3’ overhang that is 
protective against NHEJ-mediated fusion. We postulate that the MRN 
complex prevents NHEJ of newly synthesized leading strand telomeres 
by mediating 5’ end resection of the leading strand to generate a 3’ 
overhang. In support of this notion, leading—leading chromatid 
fusions comprised ~60% of all chromatid fusions in Trf2-shRNA- 
treated Mrel1™?°" cells, suggesting that the nuclease activity of 
MRE11 is at least in part responsible for the formation of the 3’ over- 
hang at the leading strand (Fig. 2d, e). 

Although the POT1a-TPP1 shelterin sub-complex™ binds to the 3’ 
single-stranded telomeric overhang and represses ATR-mediated 
DDR?*»”, little is known about how the overhang is protected from 
engaging DNA repair pathway(s). The presence of an intact single- 
stranded telomeric overhang in Trf2-shRNA-treated Mre114/11°N 
cells, and the ability of this mutant to activate ATM and ATR signal- 
ling pathways, represent a unique opportunity to address this ques- 
tion. Using immunostaining and telomere FISH to detect the 
localization of endogenous TPP1 at single-stranded overhangs and 
the double-stranded telomeric binding protein TRF1 to mark telo- 
meres, we observed a 3.5-fold reduction of TPP1, but not TRF1, at 
telomeres in Mre1 1"!7?”" cells after TRF2 depletion (Supplementary 
Fig. 6a—c), reinforcing the idea that telomeric accumulation of the 
POTla-TPP1 complex is reduced when TRE2 is depleted’*. In con- 
trast, TPP1 immunostaining remained robust in Trf2-shRNA-treated 
Mre11""!?°\' cells (Supplementary Fig. 6a-c), suggesting that 
POTla-TPP1 remains complexed to the single-stranded telomeric 
overhangs in these cells. 

We next addressed whether the removal of POT1a—TPP1 rendered 
the overhang amenable to DSB repair. We first depleted TRF2 in 
Mre1 147°" MEFs, then removed POT1la, POT1b or TPP1 from 
the single-stranded overhang using shRNAs (Supplementary Fig. 
7a). Depletion of POT1la but not POTIb resulted in a threefold 
increase in telomere end-to-end fusions (Fig. 3a, b), supporting the 
hypothesis that in mouse cells, POT1a helps protect telomeres from 
engaging in inappropriate repair, whereas POT1b is involved in the 
protection of the Cstrand'*'’. Removal of TPP1 (ref. 4) from the 3’ 
overhang of Trf2-shRNA-treated Mre11"!2°"’4 cells resulted in a five- 
fold increase in chromosome end-to-end fusions, with 30% of all 
chromosomes fused (Fig. 3a, b and Supplementary Figs 7a and 8a, 
b). Taken together, these results demonstrate that at telomeres lacking 
TRE2 and MREI1 nuclease activities, the POT1la-TPP1 complex is 
able to protect telomeres from participating in inappropriate repair 
pathways. 

ATM, ATR and 53BP1 have been implicated in mediating NHEJ of 
dysfunctional telomeres after the removal of TRF2 (refs 4, 5, 20). We 
found that the depletion of ATR, but not of ATM, resulted in a 
fourfold reduction in the number of end-to-end chromosomal 
fusions induced by retroviral expression of full-length Tpp1"? (a 
TPP1 mutant in which the POT 1a recruitment domain is deleted; ref. 
4) in Trf2-shRNA-treated Mrel1™!?°'’* cells (Fig. 3c, d and 
Supplementary Figs 7b and 9a—c). Similarly, depletion of 53BP1 led 
toa fourfold reduction of chromosomes fused in Mre1 1'"!??/“ MEFs 
expressing TppI**? (Fig. 3c, d and Supplementary Fig. 7c). 
Collectively, these data argue that in the absence of the POT1la— 
TPP1 complex, processing of the single-stranded overhang does 
not require MRE11 nuclease activity to mediate ATR and 53BP1- 
dependent chromosomal fusions. We postulate that other nucleases 
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Figure 3 | POT1a-TPP1 complex protects single-stranded overhangs from 
DNA repair. a, Depletion of POT1a or TPP1, but not POT1b, results in 
pronounced telomere fusions in metaphase spreads from Trf2-shRNA- 
treated Mre11"!2°N’4 MEFs, with telomeric PNA-FISH (red) and DAPI 
(blue). b, Quantification of telomere fusions from representative images 
shown in a. Error bars, s.d.; n = 350, asterisk, P< 0.001. ¢, Depletion of ATR 
or 53BP1 inhibits chromosomal fusions generated by expression of Tpp14”” 
cDNA and removal of TRE2 in Mre117!7°\’4 MEFs. d, Quantification of 
telomere fusions from representative images shown in c. Error bars, s.d.; 
n= 900, asterisk, P< 0.001. Original magnification for all images, < 630. 


in the cell, for example the nucleotide excision repair nuclease 
ERCC1 (also known as XPF) that has been shown to remove the 3’ 
overhang when coupled to the NHEJ machinery’’, might substitute 
for MRE11 nuclease function at this stage. 

Furthermore, we asked which pathway (NHEJ versus homologous 
recombination) is required for repair of the single-stranded over- 
hang. Removal of POT1a-TPP1 from Mre11''”"\’* MEFs resulted 
in increased homologous recombination at telomeres, detected as a 
fivefold increase in telomere sister chromatid exchanges (T-SCEs)*! 
(Supplementary Fig. 10a, b). Notably, T-SCEs were not increased 
significantly in POTla-TPP1-depleted Mre11‘"!?\* cells, suggest- 
ing that the MRE11 nuclease activity has an important role in telo- 
meric homologous recombination (Supplementary Fig. 10a-c). 
When TRF2 is removed from POT1a—TPP1-depleted Mre11/"!7°N/4 
cells, robust LIG4-dependent chromosomal fusions were observed 
(Fig. 4a—c). However, T-SCE levels in these cells remained similar to 
those observed in control cells (Fig. 4d and Supplementary Fig. 11). 
These results support the hypothesis that the main function of 
POT 1la-TPP1 is to repress homologous recombination at telomeres, 
whereas TRF2 is primarily responsible to repress NHEJ at telomeres. 

The data presented here reveal several roles for the MRN complex at 
telomeres. MRN interacts with TRF2 and localizes to telomeres 
throughout the cell cycle'’. Together with TRF2, MRE11 has a pro- 
tective role in preventing NHEJ-mediated fusion of leading strand 
telomeric DNA after replication by promoting 5’ leading strand resec- 
tion to generate POT1a—TPP1 bound 3’ overhangs that prevent NHEJ 
(Supplementary Fig. 12). Although the MRE11 nuclease activity has a 
role in this process, additional factors are likely to be involved**. One 
such candidate is CtIP (also known as RBBP8)**™, because recent data 
indicate that the yeast homologue SAE2 (also known as UBA2) 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE|Vol 460|13 August 2009 


Lig4 shRNA 


Lig4 shRNA 


cC 
a) 
& 
Mre114/H129N 
Tr2 shRNA Tpp 1492 


TTAGGG CCCTAA 


(Vector 


g a Lig4 shRNA $ 
2 
fo} n 
® 30-7 t 
° dS 
iS = 
° = Vector 
= 204 s 
= = 
a 404] | Z = 
no 

c © 
Ss Ss 
=) 

rm 


Piel 7 A/H129N 


Trf2 shRNA Tpp 142 


Figure 4 | LIG4 is required for repair of unprotected single-stranded 
telomeric overhangs. a, Immunoblot demonstrating that Lig4 shRNA 
efficiently depletes LIG4 from Mre11'"!7°""“ nuclease-deficient MEFs 
expressing Tpp1°"? and Trf2 shRNA. Nsp, nonspecific protein used as 
loading control. b, LIG4 is required for chromosomal fusions in 
Mre11'"!?°N' nuclease-deficient MEFs expressing Tpp14®” and Trf2 
shRNA. ¢, Quantification of the number of fusions per chromosome from 
representative images shown in b. Error bars, s.d.; n = 850, asterisk, 
P<0.001. d, T-SCEs in Mre11 HI29NIA 1 clease-deficient MEFs depleted in 
LIG4, TRE2 and expressing TPP14®”. CO-FISH was performed using FITC- 
OO-(TTAGGG), (green) and Tam-OO-(CCCTAA), (red) probes. Original 
magnification for all images, < 630. 


cooperates with MRE11 to remove nucleotides at DSBs in a 5’-3' 
endonucleolytic manner before extensive resection is performed by 
other nucleases, including ExoI and Dna2 (refs 25, 26). However, the 
generation of 3’ single-stranded overhangs has the undesired con- 
sequence of initiating inappropriate homologous recombination at 
telomeres. Indeed, the MRE11 nuclease activity is required for telo- 
mere homologous recombination after POTla—-TPP1 is removed 
from the 3’ overhang. Because initiation of homologous recombina- 
tion requires RPA binding to single-stranded DNA, we speculate that 
the POT1la—TPP1 complex represses homologous recombination by 
preventing RPA access to the 3’ overhang. 

Removal of TRF2 from telomeres stimulates the MRN complex to 
activate ATM and 53BP1 to promote NHEJ. Surprisingly, removal of 
TRF2 also liberates the MRE11 3’—5’ nuclease activity to process 3’ 
telomeric overhangs before NHEJ of chromosome ends. Because 
single-stranded telomeric overhangs are incompatible with DNA 
ligation’, we speculate that degradation of the 3’ overhang by 
MRE11 (ref. 27), probably in conjunction with other components 
of the NHEJ pathway'*”, is required to generate telomeric substrates 
amenable for joining by LIG4. This model is supported by the obser- 
vation that MRE11 nuclease activities are required for efficient NHEJ 
during class-switch recombination in developing lymphocytes”. 

Our studies also uncover an unexpected role for the POT1la—TPP1 
complex in protecting the 3’ overhang from NHE] in the absence of 
TRF2 and MRE11 nuclease activity. Because TRF2 is thought to be 
essential for the formation of the telomeric-loop structure proposed to 
protect chromosome ends from engaging the NHEJ pathway, our 
result support the notion that under certain conditions the telomeric 
loop is dispensable for telomere end protection”’. We speculate that the 
POT 1la-TPP1 complex cooperates with TRF2—RAP1 during telomere 
replication, when the telomeric loop is transiently lost, to protect the 
integrity of linear chromosomal ends”. 
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METHODS SUMMARY 

Generation of MEFs and conditional deletion of MRE11. MEFs were isolated 
from embryonic day (E)13.5 embryos obtained from crosses between Mre11'/“ 
and Mre11'/1112°N mice, and grown in standard culture conditions. PCR-based 
genotyping was performed as described*. Primary MEFs were immortalized at 
passage 2 by transfection with pBabeSV40LT. To delete MRE11, MEFs with 
different genotypes were treated with Cre-expressing adenoviruses at multipli- 
cities of infection of 500. Depletion of MRE11 was confirmed by PCR and 
western blot. 

shRNA interference. Two T7f2 shRNAs were generated in pSuper. To generate 
Retro-pSuper constructs, EcoR1- and Xhol-digested insert from pSuper was 
subcloned into the same site into Retro-pSuper vector. The shRNA target 
sequences for mouse Trf2 are: shRNAI1, 5’-CTGTCATTATTTGTATCAA-3’; 
and shRNA2, 5'-GAACAGCTGTGATGATTAA-3’. The target sequence of Trf2 
shRNA2 was changed to 5’-CGTACTGCAGTCATGATC-3’ by standard site- 
directed mutagenesis to create Trf2-shRNA-resistant construct Retro-pSuper 
shTRF2-2M (Stratagene). Lentivirus-based shRNAs for Lig4, Mrel1 and 53BP1 
were purchased from Sigma (sequences available on request). Published target 
sequences” were constructed into Retro-pSuper and used for knockdown of ATR. 
shRNAs for Potla, Potlb, Tpp1 and the TPP1“®? construct Tpp14*? were as 
described". Retrovirus or lentivirus-mediated efficient knockdown of target genes 
was verified by either RT-PCR or immunoblotting. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mouse genotyping. Mre1 1 genotyping was performed as described’. The follow- 
ing primers were used: forward, 5’-TACAAAAGGTTGAAAATTTGAGAAGC-3’, 
reverse, 5'-TGTAATTGCAGGT-CCTTAAAGGC-3’. The thermocycling condi- 
tions were 36 cycles of 95 °C for 30s, 52°C for 1 min, and 72 °C for 1 min. 
RNA isolation, RT-PCR. RNA was isolated from approximately 10° cells with 
the Qiagen RNAeasy kit. RT-PCR was performed with the oligo-dT RT-PCR 
system according to the protocol provided by the manufacturer (Invitrogen). 
The following primers were used: mouse Potla forward, 5'-GATGACGTCA 
CAGGCGCCTAGG-3’; reverse, 5’-TCCCATACACACTGCACTCAATGG-3’; 
mouse Potlb forward, 5'-CTTTAAGCCTCCGGCCTTAAGCAAAGG-3’; 
reverse, 5'-CITTGGACATGATTATCAGCAACGACAATGTCTAC-3’; mouse 
Tpp1 forward, 5’-ATGTCCGATTCAGGGTTGCTGG-3’; reverse, 5’-TCATAC 
CTGGGTTAACTCAGACTCTGACTC-3’; mouse Gapdh forward, 5'-TCACCA 
CCATGGAGAAGGC-3’; reverse, 5’-GCTAAGCAGTTGGTGGTGCA-3’. 
Immunoblotting. Cell extracts were isolated and western blot was performed as 
described*. Antibodies were: phospho-ATM Ser 1981 from Rockland; phospho- 
ATR, ATRand MRE11 from Cell Signaling; CHK2 from BD Biosciences; y-tubulin 
and haemagglutinin from Sigma; y-H2AX from Upstate; anti-53BP 1 antibody was 
obtained from P. Carpenter at UT Medical School; anti-TRF1 and -TREF2 antibod- 
ies were obtained from J. Karlseder at Salk Institute. 

Immunofluorescence and TIF analysis. Immunofluorescence and TIF analysis 
assays for cells with different genotypes grown on coverslips were performed as 
described previously* using the primary antibody TRF1, TPP 1, 53BP1 or y-H2AX. 
Secondary antibodies against mouse or rabbit were labelled with Alexa 488 
(Molecular Probes). For the TIF assay, the same primary and secondary 
antibodies were used with a Tam-OO-(CCCTAA), PNA telomere probe 
(Applied Biosystems). DNA was counterstained with DAPI and slides were 
mounted in 90% glycerol, 10% PBS containing 1 1g ml! p-phenylene diamine 
(Sigma). Digital images were acquired and analysed as described’, using a Nikon 
Eclipse 800 microscope, X63 apo-plan objective and a CCD camera with a 5 


nature 


megapixel, 9 X 9 micron chip. Bias, dark and flat-field frames were taken for 
image processing. Only cells with =five 53BP1 or y-H2AX signals co-localized 
with telomere signals were scored. 

Telomere FISH and CO-FISH. Cells were collected at indicated time points and 
fixed; FISH and CO-FISH were performed as described previously*”’. 
Hybridization of metaphase spreads was performed with Tam-OO- 
(CCCTAA), peptide nucleic-acid probes (Applied Biosystems). For CO-FISH, 
metaphase spreads were incubated sequentially with 5’- Tam-OO-(CCCTAA)4- 
3’ and 5’-FITC-OO-(TTAGGG),-3’ probes. Control experiments were 
performed to ensure that the CO-FISH signals were dependent on the incorpora- 
tion of BrdU, and that the procedure did not generate signals owing to unintended 
denaturation of the telomeric DNA. A minimum of 500 chromosome ends were 
scored blindly for each genotype, and pairwise comparisons for statistical signifi- 
cance were made by Student’s t-test. Differences between genetic backgrounds 
were considered significant only when P values were less than 0.01. Digital Images 
were captured as described earlier and processed with MetaMorph Premier 
(Molecular Devices). 

In-gel telomeric G-overhang assay. In-gel G-overhang assays were performed 
essentially as described'’. After pulse-field gel electrophoresis, gels were dried 
down at 40°C and prehybridized at 50°C for 1h in Church mix (0.5M 
Na,HPO,, pH 7.2, 1 mM EDTA, 7% SDS and 1% BSA), followed by hybridiza- 
tion at 50 °C overnight with an end-labelled (CCCTAA), oligonucleotide. After 
hybridization, gels were washed three times with 4X SSC for 30 min and once 
with 4X SSC, 0.1% SDS. Gels were exposed to PhospholImager screens. After 
G-overhang assays, gels were alkali denatured (0.5 M NaOH and 1.5M NaC), 
neutralized (3M NaCl and 0.5M Tris-HCl, pH7.0), rinsed with H,O, and 
reprobed with labelled (TTAGGG), oligonucleotide at 55 °C and then processed 
as previously. To determine the relative overhang signal, the signal intensity for 
each lane was determined before and after denaturation using Imagequant soft- 
ware. The G-overhang signal was normalized to the total telomeric DNA and this 
normalized value was compared between samples. 
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Canadian changes in the air 


British Columbia aims to become a leader in clean-energy technology. Virginia Gewin tracks progress. 


he Canadian province of British 

Columbia has a simple strategy 

for becoming a hub of clean 

technology — be bold. Under its 
premier Gordon Campbell, the province has 
established aggressive clean-energy initiatives, 
including North America’ first carbon tax 
(see http://tiny.cc/HkvYn) to meet its goal of 
reducing greenhouse-gas emissions to 33% 
below 2007 levels by 2020. Carbon-based 
fuels will be taxed at a rate of Can$10/tonne 
(US$9.28) of carbon emissions now and 
at Can$30 per tonne by 2012. Campbell's 
government has also made reducing carbon 
emissions part of a wider plan to stimulate job 
growth in the region, where the traditional 
natural-resources industries of forestry 
and fisheries are struggling. From biofuels 
to wave energy, the province's government 
and the Canadian federal government have 
made sizeable investments in the research, 
development and commercialization of clean- 
energy alternatives to fossil fuels. 

The plan seems to be paying off. Early- 
stage companies are getting the funding they 
need to hire additional staff, and energy- 
related training opportunities abound. The 
big challenge, however, will be bringing these 
technologies to fruition. In the face of an 
uncertain market and the significant up-front 
costs of commercializing a new energy 
technology, British Columbia's leaders have 
encouraged investment in 
clean energy, “The real prize 
is becoming a ‘go-to’ place to 
sell technologies to the rest 
of the world, says Andrew 
Walls, director of the British 
Columbia Innovation 
Council's ocean sciences 
and energy programme. 

To complement the federally funded Can$1- 
billion Sustainable Development Technology 
Canada (SDTC) fund, devoted to developing 
clean energy, most notably biofuels, British 
Columbia has created its own clean-tech 
funds. The Can$25-million annual Innovative 
Clean Energy Fund (ICE) and the Can$25- 
million British Columbia Bioenergy Network 
(BCBN) emphasize help to companies that are 
developing biofuels. In 2008, the provincial 
government released a bioenergy strategy to 
help guide its energy plan. “This is one of the 
few sectors that has legs, and people actively 
developing technologies,” says Michael 
Weedon, executive director of the BCBN. 

Lignol, an emerging biofuel company 
based in Burnaby, has received funding 
from the ICE, the SDTC and the BCBN to 
support its efforts to produce ethanol from 


has legs.” 
— Michael Weedon 
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“Bioenergy is one of 
the few sectors that 


Sited in the southwestern corner of British Columbia, Vancouver is one of the region's skills centres. 


woody materials. It wants to recruit scientists, 
primarily those with biology and chemistry 
backgrounds, to explore and develop new 
technologies. Lignol’s chief operating officer 
Michael Rushton says the company looks 
for people with skills in chemical processing 
or enzyme chemistry as it continues to 
improve the efficiency of the 
conversion of lignocellulose 
into ethanol. 

Despite the emphasis 
on biofuels, there are also 
job opportunities in the 
areas of wave energy and 
carbon sequestration. In 
March, SyncWave, a company in Pemberton, 
received Can$2.7 million from the SDTC 
to field test its technology, which taps the 
energy of ocean swells. Chief executive Nigel 
Protter says he needs scientists and engineers 
who understand hydrodynamics and fluid 
dynamics. British Columbia will also be the 
site of one of eight Canada-wide projects that 
will share Can$140 million in government 
funds to conduct carbon-capture and storage 
demonstrations. Spectra Energy, based in 
Houston, Texas, which specializes in natural- 
gas infrastructure, will oversee the carbon- 
capture project in British Columbia from its 
offices in Vancouver and Calgary, Alberta. 
Tony Irwin, the company’s director of climate 
change and energy efficiency, says he is 
looking for people who can combine multiple 
skills, including chemistry, hydrogeology, 
policy and engineering. 
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Both the public and the private sectors are 
taking steps to ensure that British Columbia's 
human scientific resources match the clean- 
tech initiative. The provincial government 
has established the Leading Edge Endowment 
Fund, a Can$56.25-million joint government- 
private sector initiative to create 29 permanent 
endowed chairs at British Columbia 
universities and colleges in areas including 
health, technology and the environment. 

The hope is that 20 permanent leadership 
research chairs will attract top talent and that 
nine ‘regional innovation chairs’ will spur 
technology transfer at smaller colleges and 
technical institutes. 


Training grounds 

The Natural Sciences and Engineering 
Research Council of Canada’s Collaborative 
Research and Training Experience 
(CREATE) programme aims to fill 
interdisciplinary gaps. For example, the 
CREATE program in interdisciplinary 
climate science will partner with the 
Canadian Centre for Climate Modelling 
and Analysis at the University of Victoria 
with the Institute of Ocean Sciences on 
Vancouver Island, run by Fisheries and 
Oceans Canada. The programme will carry 
out research in areas crucial to industry and 
government, such as developing new ways 
of predicting the evolution of terrestrial and 
oceanic ecosystems, and climate diagnostics 
research to uncover signatures of climatic 
variation. Other CREATE programmes 
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will target British Columbia's 
health and life-sciences 
industries, and those devoted 
to the environment and energy 
include bioenergy, biorefining 
and biodiversity. Each 
programme receives Can$1.65 
million to split among stipends 
for undergraduate, graduate 
and postdoctoral students. 
Meanwhile, a new climate- 
science initiative should mean 
several million dollars in 
funding for graduate students, 
postdocs and visiting scientists. 
In March 2008, the Pacific 
Institute for Climate Solutions 
(PICS) was established with a 
Can$90-million endowment. 
PICS, a partnership between the 
University of British Columbia 
in Vancouver, the University 
of Victoria, Simon Fraser 
University in Burnaby and the 
University of Northern British 
Columbia in Prince George, 
will spend around Can$1 
million of its operating budget 
on training. Projects are still 


being planned but scientists will be needed 
to address how ecosystems and humans can 
adapt to climate change, says Nancy Olewiler, 


British Columbia premier 
Gordon Campbell (top), 
and innovation council head 
Andrew Walls see clean 
energy as the future. 


director of the public-policy 
programme at Simon Fraser 
University and a member 

of the PICS programme 
committee. She expects 

teams from each university to 
participate in interdisciplinary 
projects such as developing 
solutions to water shortage, 
tracking climate-induced 
emerging health issues and 
exploring how timber can 
meet bioenergy needs (see 
‘Bioenergy investment benefits 
cash-strapped life-sciences 
sector’). 

Formal partnerships with 
two recently established 
oceanography initiatives 
should extend PICS's 
interdisciplinary reach. The 
newly established Can$100- 
million North-East Pacific 
Time-Integrated Undersea 
Networked Experiments 
(Neptune Canada) and the 
Can$25-million Victoria 
Experimental Network Under 


el 


‘ 


the Sea (Venus) project, a 
deep-sea and coastal ocean observatory, will 
provide climate-science opportunities for 
oceanography graduate students and postdocs. 


They will have the task of sifting through an 
unprecedented variety and amount of data, 
which can be used, for example, to visualize 
the sea floor and monitor conditions such 
as oxygen depletion and ocean acidification, 
says Verena Tunnicliffe, professor of deep- 
sea research at the University of Victoria and 
Venus'’s project director. Tunnicliffe’s team is 
exploring ways to help oceanographers handle 
the huge amount of data generated by the 
Neptune and Venus programmes. She says the 
training possibilities for young scientists in 
British Columbia are endless. 

Connecting graduate research to 
burgeoning industries in British Columbia 


is the goal of the Accelerate programme, run 


by the mathematical and science research 
network MITACS, in Vancouver, which 
fosters graduate internships with potential 
employers in high-tech industry, government 
and non-profit organizations. The aim is 

to give companies, and in particular those 
in the clean-tech sector, access to cutting- 
edge university research, while students get 
a first-hand experience of what industry 
needs. “British Columbia is moving away 
from its history as a resource extractor,’ says 
Tunnicliffe, “to a region using technology to 
find exciting ways to use and develop these 


resources sustainably’ a 
Virginia Gewin is a freelance writer based in 
Portland, Oregon. 


The economic downturn is hitting 
Canadian life-science companies 
hard. On 15 July, BIOTECanada, 
Canada's biotechnology industry 
organization, released survey 
data suggesting that, including 
jobs already lost, 7,000 highly 
skilled researchers and scientists 
could be laid off throughout the 
country in the next year if short- 
term financing from private or 
government sources cannot 
be obtained to keep struggling 
companies afloat. 

Although British Columbia 
has a relatively mature 
biopharmaceutical sector that is 
better placed to weather the storm, 
the recession has had an impact, 
says Bob Ingratta, bioproducts 
and bioenergy sector specialist at 
the non-profit industry-support 
organization LifeSciences British 
Columbia, based in Vancouver. 

The Canadian government's 
investment in bioenergy and clean 
technology could offer a welcome 
respite, and create opportunities 
for those with genomics or 
bioinformatics skills. For example, 
Jorg Bohlmann, a forest biologist at 
the University of British Columbia 
in Vancouver, has a grant of 


Can$7.5 million (US$7 million) 
from Genome British Columbia, 
one of six centres that vie for funds 
from the central organization 
Genome Canada. 

Bohlmann is hunting for genomic 
clues to the outbreaks of mountain 
pine beetle that threaten Canada’s 
forests (pictured), and is exploring 
how the biofuels sector might use 
infested wood as biomass. He 
wants to train people in genomics, 
bioinformatics and ecological risk 
modelling. “If we can't truly stop 
forest impact of the pine mountain 
beetle, we may be able to better 
utilize forest resources that are 
infected,” he says. 

Pierre Meulien, Genome British 
Columbia's chief scientific officer, 
says that although Genome Canada 
did not receive any new funding in 
this year's budget, it is funded up 
to the end of 2013. Genome British 
Columbia also receives money 
from the provincial government 
specifically for research on local 
priorities such as bioenergy. For 
example, researchers funded by 
Genome British Columbia at the 
University of British Columbia 
in Vancouver and the University 
of Victoria are investigating how 


to maximize the fermentation of 
lignocellulose to bioethanol, using 
poplars. 

The Canada Foundation for 
Innovation has provided Can$71 
million in funding for 16 major 
infrastructure projects in British 
Columbia, with particular 
emphasis on ocean research 
and the life sciences. The focus 
is on maintaining cutting-edge 
research platforms in life sciences, 
including a Can$10 million ultra- 
high-throughput DNA sequencing 
platform for large-scale genome 
analysis at the University of British 


BIOENERGY INVESTMENT BENEFITS CASH- sadist LiFe ey SECTOR 


Columbia. Such investments are 
considered key to maintaining 
British Columbia's success. 
Christoph Borchers, director of 
the University of Victoria-Genome 
British Columbia Proteomics 
Centre, sees ample promise in 
the province's diverse portfolio 
of genomics research despite 
funding woes. “Genome British 
Columbia has a lot of funding 
from the provincial government, 
which wants to see the community 
broadly apply ‘omics to all areas 
that affect human health from trees 
to fisheries,” he says. V.G. 
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Bombs away! 


Happy landings. 


Paul Di Filippo 


Having left the McConnell Air Force Base 
in Kansas just a few hours earlier, the squad- 
ron of long-range B-5 ‘Shelly O’ Stealth 
bombers arrived over Igboland in south- 
eastern Nigeria at 3:13 a.m., local time. The 
air defences of the reclusive and hostile 
dictatorship (a failed state since the collapse 
of the global petroleum industry after the 
advent of microbe-generated electricity 
from trash) could not detect the invaders. 

The payloads unleashed by the bombers, 
however, were a different matter. 

Each package measured some 4 cubic 
metres, big as a freestanding urban street 
toilet, swaddled in protective foam and 
with a chute on top. 

Soon, mushroom-like synthetic blooms 
dotted the night sky all over Igboland. 

Nigerian troops scrambled to meet their 
descent. 

Each package, as it touched down in 
countryside, town or city, automatically 
jettisoned its self-destructing foam coat- 
ing and parachute, removing evidence of 
the landing. 

Revealed was what, indeed, appeared 
to be an urban street-toilet: a shed-sized, 
streamlined plastic structure, windowless, 
with a curving door panel. 

In 90% of the landings, soldiers arrived 
on the scene first, surrounding the 
structures menacingly, weapons raised, 
until military trucks arrived to haul the 
invaders away. 

Occasionally, average citizens reached 
the bombs first. The finders generally 
cooperated. They sought to shift the struc- 
tures out of sight of the authorities. But 
sometimes fights erupted, or pirate bands 
intervened. For the most part, unless the 
citizens moved very fast, the soldiers soon 
showed up and took the prizes away, bru- 
tally and with bloodshed. 

But in a very small number of instances, 
the bombs passed safely and secretly into 
the hands of non-state individuals. 


A young, orphaned bachelor, Okoronkwo 
Mmadufo grew pear! millet and raised 
goats on the edge of an abandoned and 
decaying Chinese coltan-processing plant; 
land no one else coveted as it was seeded 
with toxic waste. His farm struggled to 
provide even one person with a subsist- 
ence living. The soil made his crops sick 
and the vegetation did the same to his ani- 
mals. Okoronkwo despaired of ever being 
rich enough to afford a wife and family. 


The night of the bombing run the 
farmer was awake, tending a sick 
goat. He looked up when he heard 
a muffled but sizable thump, and 
saw the bomb settle atop a patch of 
scrawny millet plants. He left the goat 
and rushed to the structure. 

He began to push futilely at the 
big bomb, which was nearly as 
large as his house. But then he saw 
a large red unlabelled button near 
the door panel, and he slapped it. 

The bomb lifted itself up on a set 
of wheels and an air-cushion effect. 

Okoronkwo ran with the bomb 
towards the deserted, ruined fac- 
tory. A small outbuilding looked 
impenetrably collapsed upon itself. But 
Okoronkwo knew the secret of its access. 

He moved some timbers and hauled 
aside a wall of galvanized tin and got the 
bomb hidden. After grabbing a branch, he 
erased any slight tracks leading back to the 
landing site. 

The soldiers found him cradling his sick 
goat. 

After interrogation and discussion, the 
soldiers decided not to investigate the 
abandoned plant, as they had heard that 
the effect of the toxic waste would be to 
cause their penises to disappear. They had 
much sport speculating on Okoronkwos 
genital shrinkage, then left. 

Okoronkwo waited until the next night to 
investigate the bomb in the outbuilding. 

When the curving plastic portal opened, 
light flooded the interior of the bomb. 
Okoronkwo quickly stepped inside and 
shut the door. 

The interior of the structure was much 
smaller than he expected, indicating con- 
cealed machinery or reservoirs. The only 
visible features were an intake hopper, a 
dispensing chute and a docked cellphone. 

Okoronkwo picked up the phone and it 
came alive. 

Speckled with animated glyphs, the face 
of a young white guy appeared. 

“Sticky here. What's your name?” 

“Okoronkwo Mmadufo?” 

“Gonna call you OM. Here's the tranche. 
You're now the proud owner of a Biofab 
Field Unit. A BFU. It comes supplied with 
feedstocks — just common stuff you'll be 
able to replace — and smart microbes that 
will handle their own reproduction, as 
well as diagnostic, engineering and inter- 
face instrumentation. PCR, nucleotide 
decouplers and linkers, sequencers — the 
works. You can use the BFU to make nearly 
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any medicine or other 
products of any natural or 
synthetic organic processes. The unit will 
tailor doses of active agents, as well, for 
dispersal into the environment. You run 
everything via the cellphone. You'll see 
the control panel now on the touchscreen, 
with a link to an interactive tutorial. Click 
on the terms of agreement, please, OM — 
Swell! Goodbye” 

“Wait! Ihave many questions!” 

“Sorry, the feds aren't paying me to 
answer questions. Strictly freelance. So, ’m 
gone. Unless — can you get me any rare 
highlife recordings?” 

“You like live shows of Dr Sir Warrior?” 

“Hell yeah!” 

“T can get those” 

“Bring me tracks I don't have, and ’m 
yours to command? 

Over the next week, Okoronkwo and 
his new friend used the BFU to tailor a 
remediation treatment for the soil, a cure 
for pearl millet top rot, and nutraceuticals 
for the goats. 

Okoronkwo came to feel confident in 
his prowess with the BFU, and eventually 
bade Sticky goodbye. He knew now that 
he could continue to help himself and his 
neighbours, and that his personal future 
would include a woman and children. 

But first he had to tailor a certain lethal 
smart bug, keyed only to the genomes of 
Nigeria's rulers. These men were lax with 
their condom use, and obtaining their seed 
would be no chore at all. a 
Having celebrated his thirtieth year as a 
professional writer, Paul Di Filippo 
believes he still detects potential for 
self-improvement. 

Join the discussion of Futures in Nature at 
http://tinyurl.com/kkh3kt 
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